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ENZYMATIC CONVERSION OF GDP-MANNOSE TO GDP-FUCOSE 



BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to the synthesis of oligosaccharides. In 
particular, it relates to improved enzymatic synthesis of GDP-fucose, which can be used in 
fucosylation reactions. The methods make possible the synthesis of complex fucosylated 
oligosaccharides in a single vessel using readily available starting materials. 

Background 

Increased understanding of the role of carbohydrates as recognition elements 
on the surface of cells has led to increased interest in the production of carbohydrate 
molecules of defined structure. For instance, compounds comprising the sialyl Lewis 
ligands, sialyl Lewis x and sialyl Lewis 8 are present in leukocyte and non-leukocyte ceil lines 
that bind to receptors such as the ELAM-1 and GMP 140 receptors. Polley et aL, Proc. Natl 
Acad ScL, USA, 88: 6224 (1991) and Phillips et al. y Science, 250: 1 130 (1990), see, also, 
USSN 08/063,181. 

Because of interest in making desired carbohydrate structures, 
glycosyltransferases and their role in enzyme-catalyzed synthesis of carbohydrates are 
presently being extensively studied. These en2ymes exhibit high specificity and are useful in 
forming carbohydrate structures of defined sequence. Consequently, glycosyltransferases are 
increasingly used as enzymatic catalysts in synthesis of a number of carbohydrates used for 
therapeutic and other purposes. 

In the application of enzymes to the field of synthetic carbohydrate chemistry, 
the use of glycosyltransferases for enzymatic synthesis of carbohydrate offers advantages 
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over chemical methods due to the virtually complete stereoselectivity and linkage specificity 
offered by the enzymes (Ito et al, Pure Appl Chern., 65:753 (1 993); and U.S. Patents 
5,352,670, and 5,374,541). However, the commercial-scale production of carbohydrate 
compounds is often complicated by the cost and difficulty in obtaining reactants that are 
5 used in the enzymatic and chemical synthesis of the carbohydrates. 

Improved methods for enzymatic synthesis of carbohydrate compounds, and 
precursors used in these syntheses, would advance the production of a number of beneficial 
compounds. The present invention fulfills these and other needs. 

SUMMARY OF THE INVENTION 

10 The present invention provides methods, expression vectors, and reaction 

mixtures that are useful for the efficient production of fiicosylated oligosaccharides. The 

invention provides ways by which nucleotide sugars such as GDP-fucose can be formed 

relatively inexpensively. 

In a first embodiment, the invention provides expression vectors that include 
1 5 a promoter operably linked to a nucleic acid that encodes a prokaryotic enzyme that has both 

an epimerase and a reductase activity. These two activities catalyze the conversion of GDP- 

4-keto-6-deoxymannose to GDP-fucose. 

In another embodiment, the invention provides reaction mixtures for 

synthesizing GDP-fucose. The reaction mixtures include GDP-4-keto-6-deoxymannose, 
20 N ADPH or NADH, and a prokaryotic enzyme that has both an epimerase and a reductase 

activity. The prokaryotic enzyme is one that can catalyze the conversion of GDP-4-keto-6- 
' deoxymannose to GDP-fucose. In a presently preferred embodiment, the GDP-4-keto-6- 

deoxymannose is formed by: a) providing a reaction mixture that comprises GDP-mannose, 

GDP-mannose-4,6-dehydratase, and NADP + ; and b) incubating the reaction mixture for a 
25 sufficient time to convert at least about 90% of the GDP-mannose to GDP-4-keto-6- 

deoxymannose. 

Another embodiment of the invention provides methods for the enzymatic 
conversion of GDP-mannose to GDP-fucose. These methods involve: 

a) providing a reaction mixture that comprises GDP-mannose, GDP- 
30 mannose 4,6-dehydratase (GMD), and NADP + ; 
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b) incubating the reaction mixture for a sufficient time to convert at least 
about 90% of the GDP-mannose to GDP-4-keto-6-deoxymannose; 

c) adding to the reaction mixture one or more polypeptides having GDP- 
4-keto-6-deoxymannose 3,5-epimerase and GDP-4-keto-6-galactose reductase activities; and 

5 d) incubating the reaction mixture for a sufficient time to convert the 

GDP-4-keto-6-deoxymannose to GDP-fucose. 

Also provided are methods for enzymatic synthesis of a fucosylated 
oligosaccharide. These methods involve transferring a fucose from the GDP-fucose produced 
by the methods of the invention to an acceptor saccharide. This can be accomplished by the 

10 following additional steps: e) adding a fucosyltransferase and the acceptor saccharide to the 
GDP-4-keto-6-deoxymannose produced in step b) or to the GDP-fucose produced in step d); 
and f) incubating a reaction mixture for a sufficient time to transfer the fucose from the 
GDP-fucose to the acceptor saccharide. 

Additional embodiments provide methods by which one can generate GDP- 

15 fucose starting from mannose. These methods involve the use of an enzymatic system for 
converting mannose into GDP-mannose, which is then converted to GDP-fucose using the 
above methods. The conversion of mannose to GDP-mannose involves the following 
enzymes: hexokinase, which converts mannose to mannose-6-phosphate; 
phosphomannomutase, which converts the mannose-6-phosphate to mannose- 1 -phosphate; 

20 and GDP-mannose pyrophosphorylase, which converts the mannose- 1 -phosphate to GDP- 
mannose. 

Also provided by the invention are methods for the synthesis of a fucosylated 
oligosaccharide in which efficiency-enhancing steps are used. The methods involve 
contacting an acceptor saccharide with a fucosylation reaction mixture that comprises GDP- 
25 fucose and a fucosyltransferase which transfers fucose from the GDP-fucose to provide said 
fucosylated oligosaccharide, wherein the efficiency of said fucosylation is enhanced by one 
or more efficiency-enhancing steps selected from the group consisting of: 

1 ) forming said GDP-fucose by enzymatic conversion of GDP-mannose to 

GDP-fiicose by: " 
30 a) providing a reaction mixture that comprises GDP-mannose, GDP- 

mannose 4,6-dehydratase, and NADP + ; 
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b) incubating the reaction mixture for a sufficient time to convert at least 
about 90% of the GDP-mannose to GDP-4-keto-6-deoxymannose; 

c) adding to the product of step b) one or more polypeptides having 
GDP-4-keto-6-deoxymannose 3,5-epimerase and GDP-4-keto-6- 

5 galactose reductase activities; and 

d) incubating the reaction mixture for a sufficient time to convert the 
GDP-4-keto-6-deoxymannose to GDP-fucose; 

2) adding pyruvate kinase and a substrate for the pyruvate kinase to the 
fucosylation reaction mixture, wherein GDP produced as a result of the transfer of fucose 

1 0 from the GDP-fucose is converted to GTP; and 

3) conducting the fucosylation in a reaction medium that comprises a soluble 
divalent metal cation, wherein said medium is supplemented with said soluble divalent metal 
cation to maintain the concentration of said divalent metal cation between about 2 mM and 
about 75 mM. 



15 



BRIEF DESCRIPTION OF THE FIGURES 



Figure 1 shows a schematic diagram of the enzymatic conversion of GDP- 
mannose to GDP-fucose. 

Figure 2 shows an 8-15% SDS-polyacrylamide gel analysis of extracts from 
20 bacterial cells that harbor expression plasmids for enzymes involved in synthesis of GDP- 
fucose. Molecular weight standards are in the two outside lanes. Extracts are from cells 
containing the expression vectors pTGK, pTGK:YEF B, pTGKiwcaH, and pTGK:GMD. 
SDS-PAGE under reducing conditions. Lane 1, pTGK lysate; Lane 2, pTGK:YEF Bl 
lysate;Lane 3, pTGK: YEF B2 lysate; Lane 4, pTGK:GMD lysate. 
25 Figure 3 shows HPLC profiles of the reaction products obtained using the 

GDP-mannose dehydratase expressed from the expression vector pTGK:GMD. 

Figure 4 shows the results of an assay designed to test whether the activity of 
GMD is linear over time. 

Figure 5 shows that GDP-fucose is a potent inhibitor of GDP-mannose 4,6- 
30 dehydratase (GMD). 
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Figure 6 shows that wcaU and human Fx are expressed by the corresponding 
expression vectors in E. coli. The analysis was performed using 8-15% SDS-PAGE under 
reducing conditions. 

Figure 7 A shows the results of an assay for GDP-mannose dehydratase 
activity as the first step of a coupled assay for determining the activity of YEF B. Figure 7 
shows the effect of adding YEF B to the reaction mixture after completion of the GDP- 
mannose dehydratase reaction. 

Figure 8 shows the results of a time course of the coupled GMD-YEF B 

assay. 

Figure 9 shows the results of a large-scale conversion of GDP-mannose to 
GDP-fiicose using GMD and YEF B. 

Figure 10 shows the formation of sialyl Lewis X<as indicated by increasing 
GDP) over time. The decrease in pH of the reaction mixture that occurs as a result of the 
reaction is also shown. 

Figure 1 1 shows the change in concentration of Mg 2 * over time during the 
synthesis of sialyl Lewis X. 

Figure 12 shows a time course of a large-scale sialyl Lewis X synthesis 
reaction. Samples were taken from the reaction mixture and analyzed by TLC. From left to 
right, the lanes are as follows: Lane 1, reaction at 160 hours; Lane 2, 147 hours; Lane 3, 
SLN; Lane 4, 43 hours; Lane 5, 1 hour. 

Figure 13 shows an NMR analysis of the final product obtained in the large- 
scale fucosylation reaction that used Neu5Aca2,3Galpl,4GlcNAc(31,3Gal-OR as a substrate. 

Figure 14 shows a schematic diagram of a GDP-mannose half-cycle 
fucosyltransferase reaction. 

Figure 15 shows a TLC analysis of the products of a GDP-mannose 
fucosyltransferase half-cycle reaction. From left to right: Lane 1, SLN; Lane 2, reaction at 2 
days; Lane 3, sialyl Lewis X. 

Figure 16 shows a schematic diagram of a mannose fucosyltransferase full- 
cycle reaction. 
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DETAILED DESCRIPTION 

Definitions 

Unless defined otherwise, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to which this 
5 invention belongs. Singleton et al (1 994) Dictionary of Microbiology and Molecular 
Biology, second edition, John Wiley and Sons (New York) provides one of skill with a 
general dictionary of many of the terms used in this invention. Although any methods and 
materials similar or equivalent to those described herein can be used in the practice or testing 
of the present invention, the preferred methods and materials are described. For purposes of 
1 0 the present invention, the following terms are defined below. 

The following abbreviations for carbohydrate compounds are used herein: 
Ara = arabinosyl; 

Fru = fructosyl; 

Fuc = fucosyl; 

15 Gal = galactosyl; 

GalN Ac = N-acety lgalactosaminyl ; 
Glc = glucosyl; 

GlcNAc = N-acety lglucosaminyl; 
Man = mannosyl; and 

20 Sia (NeuAc) = sialyl (typically N-acetylneuraminyl). 

Oligosaccharides are considered to have a reducing end and anon-reducing 
end, whether or not the saccharide at the reducing end is in fact a reducing sugar. In 
accordance with accepted nomenclature, oligosaccharides are depicted herein with the non- 
reducing end on the left and the reducing end on the right. All oligosaccharides described 
25 herein are described with the name or abbreviation for the non-reducing saccharide {e.g. , 
Gal), followed by the configuration of the glycosidic bond (a or P), the ring bond, the ring 
position of the reducing saccharide involved in the bond, and then the name or abbreviation 
of the reducing saccharide (e.g., GlcNAc). The linkage between two sugars may be 
expressed, for example, as 2,3, 2-»3, or (2,3). Each saccharide is a pyranose. 
30 The term "sialic acid" refers to any member of a family of nine-carbon 

carboxylated sugars. The most common member of the sialic acid family is N-acetyl- 
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neuraminic acid (2-keto-5-acetamindo-3,5-dideoxy-D-glycero-D-galactononulopyranos-l- 
onic acid (often abbreviated as NeuSAc, NeuAc, or NANA). A second member of the 
family is N-glycolyl-neuraminic acid (NeuSGc or NeuGc), in which the N-acetyl group of 
NeuAc is hydroxylated. A third sialic acid family member is 2-keto-3-deoxy-nonulosonic 
5 acid (KDN) (Nadano et al (1986) J. Biol Chem. 261 : 1 1550-1 1557; Kanamori et al (1990) 
J. Biol Chem. 265: 2181 1-21819. Also included are 9-substituted sialic acids such as a 9-0- 
C,-C 6 acyl-Neu5Ac like 9-0-lactyl-Neu5Ac or 9-0-acetyl-Neu5Ac, 9-deoxy-9-fluoro- 
NeuSAc and 9-azido-9-deoxy-Neu5Ac. For review of the sialic acid family, see, e.g., Varki 
(1992) Glycobiology 2: 25-40; Sialic Acids: Chemistry, Metabolism and Function, R. 

1 0 Schauer, Ed. (Springer-Verlag, New York (1992)). The synthesis and use of sialic acid 

compounds in a sialylation procedure is disclosed in international application WO 92/16640, 
published October 1, 1992. 

The term "nucleic acid" refers to a deoxyribonucleotide or ribonucleotide 
polymer in either single- or double-stranded form, and unless otherwise limited, 

15 encompasses known analogues of natural nucleotides that hybridize to nucleic acids in 

manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular 
nucleic acid sequence includes the complementary sequence thereof. 

The term "operably linked" refers to functional linkage between a nucleic 
acid expression control sequence (such as a promoter, signal sequence, or array of 

20 transcription factor binding sites) and a second nucleic acid sequence, wherein the 

expression control sequence affects transcription and/or translation of the nucleic acid 
corresponding to the second sequence. 

The term "recombinant" when used with reference to a cell indicates that the 
cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 

25 heterologous nucleic acid. Recombinant cells can contain genes that are not found within the 
native (non-recombinant) form of the cell. Recombinant cells can also contain genes found in 
the native form of the cell wherein the genes are modified and re-introduced into the cell by 
artificial means. The term also encompasses cells that contain a nucleic acid endogenous to 
the cell that has been modified without removing the nucleic acid from the cell; such 

30 modifications include those obtained by gene replacement, site-specific mutation, and related 
techniques. A "recombinant polypeptide" is one which has been produced by a recombinant 
cell. 
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A "recombinant expression cassette" or simply an "expression cassette" is a 
nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements 
that are capable of effecting expression of a structural gene in hosts compatible with such 
control elements. Expression cassettes include at least promoters and optionally, 
5 transcription termination signals. Typically, the recombinant expression cassette includes a 
nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a 
promoter (e.g., a dual promoter that contains a tac promoter component and a gal promoter 
component as described in PCT/US97/20528; Int'l. Publ. No. WO 98201 1 1) that is operably 
linked to the nucleic acid. Additional factors necessary or helpful in effecting expression can 

1 0 also be used as described herein. For example, an expression cassette can also include 

nucleotide sequences that encode a signal sequence that directs secretion of an expressed 
protein from the host cell. Transcription termination signals, enhancers, and other nucleic 
acid sequences that influence gene expression, can also be included in an expression cassette. 

A "heterologous sequence" or a "heterologous nucleic acid", as used herein, 

15 is one that originates from a source foreign to the particular host cell (e.g., from a different 
species), or, if from the same source, is modified from its original form. Thus, a heterologous 
nucleic acid operably linked to a promoter is from a source different from that from which 
the promoter was derived, or, if from the same source, is modified from its original form. For 
example, a UDPglucose 4-epimerase gene promoter can be linked to a structural gene 

20 encoding a polypeptide other than native UDPglucose 4-epimerase. A heterologous gene that 
encodes an enzyme involved in conversion of GDP-mannose to GDP-fucose, for example, in 
a prokaryotic host cell includes a gene that is endogenous to the particular host cell that has 
been modified. Modification of the heterologous nucleic acid can occur, e.g., by treating the 
DNA with a restriction enzyme to generate a DNA fragment that is capable of being 

25 operably linked to the promoter. Techniques such as site-directed mutagenesis are also 
useful for modifying a heterologous nucleic acid. 

A "subsequence" refers to a sequence of nucleic acids or amino acids that 
comprise a part of a longer sequence of nucleic acids or amino acids (e.g., polypeptide) 
respectively. 

30 The term "isolated" is meant to refer to material which is substantially or 

essentially free from components which normally accompany the nucleic acid, polypeptide, 
or other molecule as found in its native state. Typically, isolated molecules are at least about 
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80% pure, usually at least about 90%, and preferably at least about 95% pure as measured 
by, e.g., band intensity on a silver stained gel or other method for determining purity. Protein 
purity or homogeneity can be indicated by a number of means well known in the art, such as 
polyacrylamide gel electrophoresis of a protein sample, followed by visualization upon 
5 staining. For certain purposes high resolution will be needed and HPLC or a similar means 
for purification utilized. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

1 0 same, when compared and aligned for maximum correspondence, as measured using one of 
the following sequence comparison algorithms or by visual inspection. 

The phrase "substantially identical," in the context of two nucleic acids or 
polypeptides, refers to two or more sequences or subsequences that have at least 60%, 
preferably 80%, most preferably 90-95% nucleotide or amino acid residue identity, when 

15 compared and aligned for maximum correspondence, as measured using one of the following 
sequence comparison algorithms or by visual inspection. Preferably, the substantial identity 
exists over a region of the sequences that is at least about 50 residues in length, more 
preferably over a region of at least about 100 residues, and most preferably the sequences are 
substantially identical over at least about 1 50 residues. In a most preferred embodiment, the 

20 sequences are substantially identical over the entire length of the coding regions. 

For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are input into a computer, subsequence coordinates 
are designated, if necessary, and sequence algorithm program parameters are designated. 

25 The sequence comparison algorithm then calculates the percent sequence identity for the test 
sequence(s) relative to the reference sequence, based on the designated program parameters. 

Optimal alignment of sequences for comparison can be conducted, e.g., by the 
local homology algorithm of Smith & Waterman, Adv. Appl Math. 2:482 (1981), by the 
homology alignment algorithm of Needleman & Wunsch, J. Mol Biol. 48:443 (1970), by the 

30 search for similarity method of Pearson & Lipman, Proc. Nat 'I Acad. Sci. USA 85:2444 

(1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
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Science Dr., Madison, WI), or by visual inspection (see generally, Current Protocols in 
Molecular Biology, F.M. Ausubel et ah, eds., Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1995 Supplement) 
(Ausubel)). 

5 Examples of algorithms that are suitable for determining percent sequence 

identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are 
described in Altschul et ah (1990) J. Mol Biol 215: 403-410 and Altschuel et ah (1997) 
Nucleic Acids Res. 25: 3389-3402, respectively. Software for performing BLAST analyses is 
publicly available through the National Center for Biotechnology Information 

10 (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive- valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al 9 supra). These initial neighborhood word hits act as seeds for 

1 5 initiating searches to find longer HSPs containing them. The word hits are then extended in 
both directions along each sequence for as far as the cumulative alignment score can be 
increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters 
M (reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 

20 calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 

25 program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , an expectation 
(E) of 10, M=5, N— 4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl Acad Sci. USA 89:10915 
(1989)). 

30 In addition to calculating percent sequence identity, the BLAST algorithm 

also performs a statistical analysis of the similarity between two sequences (see t e.g., Karlin 
& Altschul, Proc. Nat 'I Acad Sci. USA 90:5873-5787 (1993)). One measure of similarity 
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provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an 
indication of the probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a nucleic acid is considered similar to a 
reference sequence if the smallest sum probability in a comparison of the test nucleic acid to 
5 the reference nucleic acid is less than about 0.1 , more preferably less than about 0.01, and 
most preferably less than about 0.001 . 

A further indication that two nucleic acid sequences or polypeptides are 
substantially identical is that the polypeptide encoded by the first nucleic acid is 
immunologically cross reactive with the polypeptide encoded by the second nucleic acid, as 

10 described below. Thus, a polypeptide is typically substantially identical to a second 

polypeptide, for example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules hybridize to each other under stringent conditions, as described below. 

The phrase "hybridizing specifically to", refers to the binding, duplexing, or 

1 5 hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions 
when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. 

The term "stringent conditions" refers to conditions under which a probe will 
hybridize to its target subsequence, but to no other sequences. Stringent conditions are 
sequence-dependent and will be different in different circumstances. Longer sequences 

20 hybridize specifically at higher temperatures. Generally, stringent conditions are selected to 
be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the temperature (under defined ionic strength, pH, 
and nucleic acid concentration) at which 50% of the probes complementary to the target 
sequence hybridize to the target sequence at equilibrium. (As the target sequences are 

25 generally present in excess, at Tm, 50% of the probes are occupied at equilibrium). 

Typically, stringent conditions will be those in which the salt concentration is less than about 
1 .0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 
8.3 and the temperature is at least about 30°C for short probes (e.g., 1 0 to 50 nucleotides) and 
at least about 60°C for long probes (e.g., greater than 50 nucleotides). Stringent conditions 

30 may also be achieved with the addition of destabilizing agents such as formamide. 

The phrases "specifically binds to a protein" or "specifically immunoreactive 
with", when referring to an antibody refers to a binding reaction which is determinative of 
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the presence of the protein in the presence of a heterogeneous population of proteins and 
other biologies. Thus, under designated immunoassay conditions, the specified antibodies 
bind preferentially to a particular protein and do not bind in a significant amount to other 
proteins present in the sample. Specific binding to a protein under such conditions requires 
5 an antibody that is selected for its specificity for a particular protein. A variety of 

immunoassay formats may be used to select antibodies specifically immunoreactive with a 
particular protein. For example, solid-phase ELISA immunoassays are routinely used to 
select monoclonal antibodies specifically immunoreactive with a protein. See, Harlow and 
Lane (1 988) Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York, 

10 for a description of immunoassay formats and conditions that can be used to determine 
specific immunoreactivity. 

"Conservatively modified variations" of a particular polynucleotide sequence 
refers to those polynucleotides that encode identical or essentially identical amino acid 
sequences, or where the polynucleotide does not encode an amino acid sequence, to 

1 5 essentially identical sequences. Because of the degeneracy of the genetic code, a large 

number of functionally identical nucleic acids encode any given polypeptide. For instance, 
the codons CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino acid arginine. 
Thus, at every position where an arginine is specified by a codon, the codon can be altered to 
any of the corresponding codons described without altering the encoded polypeptide. Such 

20 nucleic acid variations are "silent variations," which are one species of "conservatively 
modified variations." Every polynucleotide sequence described herein which encodes a 
polypeptide also describes every possible silent variation, except where otherwise noted. 
One of skill will recognize that each codon in a nucleic acid (except AUG, which is 
ordinarily the only codon for methionine) can be modified to yield a functionally identical 

25 molecule by standard techniques. Accordingly, -each "silent variation" of a nucleic acid 
which encodes a polypeptide is implicit in each described sequence. 

Furthermore, one of skill will recognize that individual substitutions, 
deletions or additions which alter, add or delete a single amino acid or a small percentage of 
amino acids (typically less than 5%, more typically less than 1%) in an encoded sequence are 

30 "conservatively modified variations" where the alterations result in the substitution of an 

amino acid with a chemically similar amino acid. Conservative substitution tables providing 
functionally similar amino acids are well known in the art. 
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One of skill will appreciate that many conservative variations of the GDP- 
fucose-synthesizing enzymes and nucleic acid that encode the enzymes yield essentially 
identical products. For example, due to the degeneracy of the genetic code, "silent 
substitutions" (i.e., substitutions of a nucleic acid sequence which do not result in an 
5 alteration in an encoded polypeptide) are an implied feature of every nucleic acid sequence 
which encodes an amino acid. As described herein, sequences are preferably optimized for 
expression in a particular host cell used to produce the enzyme {e.g., yeast, human, and the 
like). Similarly, "conservative amino acid substitutions," in one or a few amino acids in an 
amino acid sequence are substituted with different amino acids with highly similar 

10 properties, are also readily identified as being highly similar to a particular amino acid 
sequence, or to a particular nucleic acid sequence which encodes an amino acid. Such 
conservatively substituted variations of any particular sequence are a feature of the present 
invention. See also, Creighton (1984) Proteins , W.H. Freeman and Company. In addition, 
individual substitutions, deletions or additions which alter, add or delete a single amino acid 

15 or a small percentage of amino acids in an encoded sequence are also "conservatively 
modified variations". 

Description of the Preferred Embodiments 

The present invention provides methods and reaction mixtures for efficient 
production of substrates such as nucleotide sugars (e.g., GDP-fiicose) that are useful in 

20 synthesis of fucosylated carbohydrate compounds. Expression vectors for producing 

enzymes that are used in these methods are also provided. Methods for using these enzymes 
and methods for producing nucleotide sugars to produce fucosylated oligosaccharides are 
also provided by the invention. 

The methods of the invention provide significant advantages over previously 

25 available methods for fucosylation. For example, the methods provide a relatively 

inexpensive way to make GDP-fticose, an expensive compound, starting from the lower-cost 
GDP-mannose, or even from mannose. The invention provides methods by which the 
efficiency of the GDP-fucose synthesis, and also the efficiency of subsequent 
fucosyltransferase reactions, can be improved. The methods of the invention provide a high 

30 yield of the desired fucosylated compound. Accordingly, the methods of the invention are 
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well suited for commercial-scale production of fucosylated compounds, including those that 
are useful for diagnostic and therapeutic uses, foodstuffs, and the like. 

A. Cloning and Expression of Nucleic A cids Encoding Enzymes useful for 
Conversion of GDP-mannose to GDP-fucose 

5 The invention provides, in a first embodiment, methods of producing 

enzymes that are useful for converting GDP-mannose to GDP-fucose. This biosynthetic 

pathway, which is diagrammed in Figure 1, involves three enzymatic activities. The first 

enzyme, GDP-mannose dehydratase catalyzes the conversion of GDP-mannose to GDP-4- 

keto-6-D-deoxymannose. This product is then epimerized to GDP-4-keto-6-L- 

10 deoxygalactose, which is in turn reduced to GDP-L-fucose by a 4' reductase. The latter two 
enzymatic activities (epimerase and reductase) are both found in the human Fx protein 
(Tonetti et al. (1996) J. Biol Chem. 271 : 27274-27279; GenBank Accession No. U58766). 

For use in commercial -scale enzymatic synthetic reactions, however, it is 
preferred to use enzymes that are readily produced in prokaryotes, which are much easier and 

1 5 more efficient to grow at large scale than mammalian cells. Mammalian enzymes are often 
not expressed in the proper form at high yields when genes for the mammalian enzymes are 
inserted into prokaryotic host cells. Thus, it was of great interest to obtain a prokaryotic 
enzyme or enzymes to catalyze the epimerization and reduction. However, prior to the 
instant invention, it was not known whether bacterial systems for GDP-fiicose synthesis 

20 required one or two separate polypeptides to catalyze the epimerization and reduction of 

GDP-4-keto-6-D-deoxymannose to GDP-fucose. The present invention provides this missing 
information, demonstrating that one enzyme catalyzes both of these activities. In E. coli, this 
enzyme is designated YEF B. The need to produce only one enzyme to catalyze two 
activities simplifies the development and scale-up of GDP-fucose production. 

25 Accordingly, the present invention provides methods and vectors for 

recombinant production of enzymes that are useful for producing GDP-fucose. Recombinant 
production of a polypeptide generally involves obtaining a DNA sequence that encodes the 
particular enzyme, modified as desired, placing the DNA in an expression cassette under the 
control of a particular promoter, expressing the protein in a host, isolating the expressed 

30 protein and, if required, renaturing the protein. More than one of the enzymes can be 
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expressed in the same host cells, either on the same expression vector or on more than one 
expression vector that is present in the cells. 

In one embodiment, the invention provides expression vectors that are useful 
in methods for producing enzymes involved in GDP-fiicose synthesis in a host cell. For 
5 example, expression vectors are provided that are useful for expressing YEF B of E. coli, 
which catalyzes the both the epimerization and reduction of GDP-4-keto-6-deoxymannose to 
obtain GDP-fiicose. The expression vectors can also express related enzymes, in particular 
those from other prokaryotes, that also have the dual epimerase/reductase activity. 

1. Epimerase/Reductase-Encoding Nucleic Acids 

1 o The expression vectors of the invention include a nucleic acid that encodes an 

enzyme that has both GDP-4-keto-6-D-deoxymannose epimerase activity and GDP-4-keto-6- 
L-deoxygalactose reductase activity. A prokaryotic enzyme is encoded by the nucleic acid in 
presently preferred embodiments. For example, one can use a nucleic acid that encodes a 
prokaryotic epimerase/reductase from any prokaryotic species, including E. coli. The 

1 5 enzyme can be, for example, substantially identical to an E. coli YEF B polypeptide. In some 
embodiments, the expression vectors include a nucleic acid that encodes an 
epimerase/reductase enzyme that has an amino acid sequence as shown in SEQ ID NO: 1 . 
The nucleic acids can also encode polypeptides that have conservative amino acid 
substitutions compared to the amino acid sequence of a native epimerase/reductase enzyme, 

20 such as the E. coli YEF B enzyme. Typically, the nucleic acids used in the expression 

vectors of the invention are at least about 75% identical to the nucleic acid sequence of the E. 
coli YEF B coding region as shown in GenBank Accession No. U38473 (nucleotides 10748 
to 1 1230). More preferably, the nucleic acids used in the expression vectors are at least about 
85% identical to the £. coli YEF B coding region, and still more preferably are at least about 

25 95% identical. Typically, a computerized algorithm such as BLAST is used for the 

comparison, preferably using default parameters. These percentage identities can be an 
overall value for the entire coding regions, or can refer to the percentage identity over a 
particular region of the coding regions. For example, in a presently preferred embodiment, 
the nucleic acids used in the expression vectors of the invention are at least about 85% 

30 identical to the E. coli YEF B-encoding nucleic acid over a region of at least 40 nucleotides 
in length, using a pairwise BLAST algorithm (BLASTN 2.0.6 as implemented by the 
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National Center for Biotechnology Information with the following parameters: Match: 1 ; 
Mismatch: -2, Gap open: 5; Gap extension: 2, x-dropoff: 50; expect: 10.00; wordsize: 1 1 ; 
with no filtering). 

The nucleic acids that encode the epimerase/reductase enzyme can be 
5 obtained using methods that are known to those of skill in the art. Suitable nucleic acids 
(e.g., cDNA, genomic, or subsequences (probes)) can be cloned, or amplified by in vitro 
methods such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), the 
transcription-based amplification system (TAS), the self-sustained sequence replication 
system (SSR). A wide variety of cloning and in vitro amplification methodologies are well- 

10 known to persons of skill. Examples of these techniques and instructions sufficient to direct 
persons of skill through many cloning exercises are found in Berger and Kimmel, Guide to 
Molecular Cloning Techniques, Methods in Enzymology 152 Academic Press, Inc., San 
Diego, CA (Berger); Sambrook et ah (1989) Molecular Cloning - A Laboratory Manual 
(2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY, 

15 (Sambrook et ah); Current Protocols in Molecular Biology, F.M. Ausubel et ah, eds., 
Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John 
Wiley & Sons, Inc., (1994 Supplement) (Ausubel); Cashion et ah, U.S. patent number 
5,017,478; and Carr, European Patent No. 0,246,864. Examples of techniques sufficient to 
direct persons of skill through in vitro amplification methods are found in Berger, Sambrook, 

20 and Ausubel, as well as Mullis et ah, (1987) U.S. Patent No. 4,683,202; PCR Protocols A 
Guide to Methods and Applications (Innis et ah, eds) Academic Press Inc. San Diego, CA 
(1990) (Innis); Arnheim & Levinson (October 1, 1990) C&EN 36-47; The Journal OfNIH 
Research (1991) 3: 81-94; (Kwoh et ah (1989) Proc. Nath Acad. Set USA 86: 1 173; Guatelli 
etah (1990) Proc. Nath Acad. Sci. USA 87, 1874; Lomell et ah (1989) J. Clin. Chem., 35: 

25 1826; Landegren et ah, (1988) Science 241: 1077-1080; Van Brunt (1990) Biotechnology 8: 
291-294; Wu and Wallace (1989) Gene 4: 560; and Barringer et ah (1990) Gene 89: 117. 
Improved methods of cloning in vitro amplified nucleic acids are described in Wallace et ah, 
U.S. Pat. No. 5,426,039. 

Nucleic acids that encode enzymes having both epimerase and reductase 

30 activities, or subsequences of these nucleic acids, can be prepared by any suitable method as 
described above, including, for example, cloning and restriction of appropriate sequences. As 
ah example, one can obtain a nucleic acid that encodes a polypeptide that has both epimerase 
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and reductase activities by routine cloning methods. A nucleotide sequence of a gene that 
encodes an enzyme known to have both activities, such as a YEF B enzyme, can be used to 
provide probes that specifically hybridize to a gene that encodes a suitable enzyme in a 
genomic DNA sample, or to a mRNA in a total RNA sample (e.g., in a Southern or Northern 
5 blot). Suitable sequences are provided in, for example, GenBank or other sequence database. 

One suitable nucleotide sequence for use as a probe or in an expression vector 
of the invention is found in an E. coli gene cluster that encodes GDP-fucose-synthesizing 
enzymes as described by Stevenson et ah (1996) J. Bacteriol. 178: 4885-4893 (GenBank 
Accession No. U38473). This gene cluster had been reported to include an open reading 

1 0 frame for GDP-mannose dehydratase (nucleotides 8659-9780 in GenBank Accession No. 
U38473). Applicants discovered that this gene cluster also contains an open reading frame 
that encodes an enzyme that has both 3,5 epimerization and 4-reductase activities (Figure 1), 
and thus is capable of converting the product of the GDP-mannose dehydratase reaction 
(GDP-4-keto-6-deoxymannose) to GDP-fucose. This ORF, which is designated YEF B, 

15 wcaG, and fcl, is found at nucleotides 9783-10748 of GenBank Accession No. U38473. Prior 
to Applicants' discovery that YEF B encodes an enzyme having two activities, it was not 
known whether one or two enzymes were required for conversion of GDP-4-keto-6- 
deoxymannose to GDP-fucose by prokaryotes. 

The gene cluster from E. coli includes an additional ORF, which is designated 

20 wcaH (nucleotides 10748-1 1230 of GenBank Accession No. U38473). This small open 

reading frame, which encodes a GDP-mannose mannosyl hydrolase of 15 kd, is located just 
downstream of the YEF B coding region (designated wcaG). Each of these enzymes was 
expressed in bacteria as assessed by SDS-PAGE and ability to form GDP-fucose from GDP- 
mannose. 

25 J Once the target epimerase/reductase nucleic acid is identified, it can be 

isolated according to standard methods known to those of skill in the art (see, e.g., Sambrook 
et ah (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed, Vols. 7-5, Cold Spring 
Harbor Laboratory; Berger and Kimmel (1987) Methods in Enzymology, Voh 152: Guide to 
Molecular Cloning Techniques, San Diego: Academic Press, Inc.; or Ausubel et ah (1987) 

30 Current Protocols in Molecular Biology, Greene Publishing and Wiley-Interscience, New 
York). ... 
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A nucleic acid that encodes a prokaryotic epimerase/reductase can also be 
cloned by detecting its expressed product by means of assays based on the physical, 
chemical, or immunological properties. For example, one can identify a cloned 
epimerase/reductase-encoding nucleic acid by the ability of a polypeptide encoded by the 
5 nucleic acid to catalyze the conversion of GDP-4-keto-6-deoxymannose to GDP-fucose. In a 
preferred method, reverse phase HPLC is used to determine the amounts of GDP-mannose, 
GDP-fucose, and optionally one or more intermediates (e.g., GDP-4-keto-6-deoxymannose 
and GDP-4-keto-6-deoxygalactose) at various times of reaction. Suitable assay conditions 
are described in the Examples. 

10 In one embodiment, epimerase/reductase-encoding nucleic acids can be 

cloned using DNA amplification methods such as polymerase chain reaction (PGR). Thus, 
for example, the nucleic acid sequence or subsequence is PCR amplified, using a sense 
primer containing one restriction site (e.g., Xbal) and an antisense primer containing another 
restriction site (e.g., HindUl). This will produce a nucleic acid encoding the desired 

1 5 epimerase/reductase amino acid sequence or subsequence and having terminal restriction 
sites. This nucleic acid can then be easily ligated into a vector containing a nucleic acid 
encoding the second molecule and having the appropriate corresponding restriction sites. 
Suitable PCR primers can be determined by one of skill in the art using the sequence 
information provided in GenBank or other sources. Appropriate restriction sites can also be 

20 added to the nucleic acid encoding the epimerase/reductase or amino acid subsequence by 
site-directed mutagenesis. The plasmid containing the epimerase/reductase-encoding 
nucleotide sequence or subsequence is cleaved with the appropriate restriction endonuclease 
and then ligated into an appropriate vector for amplification and/or expression according to 
standard methods. 

25 Examples of suitable primers suitable for amplification of GDP-fucose- 

synthesizing enzymes are shown in Table 1 ; each primer pair is designed to provide a 5' 
Xbal restriction site and a 3' HindUl site on the amplified fragment. The plasmid containing 
the enzyme-encoding sequence or subsequence is cleaved with the appropriate restriction 
endonuclease and then ligated into an appropriate vector for amplification and/or expression 

30 according to standard methods. 
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Enzyme 


5' Primer 


3' Primer 


GDP-mannose 
dehydratase (E. 
coli) 


5 '-CGCTCTAGATACATGT 
C AAAAGTCGCT-3 ' 


5 '-GCGAAGCTITI ATGA 
CTCCAGCGCGAT-3' 


YEF B (E. coli) 


5 ' -CGTCCTAG AGCG AT 
GAGTAAACAACGAGTT-3 ' 


5'-GCGAAGCTTTTACCCCC 
GAAAGCGGTC-3 ' 


Wca H (E. coli) 


5 ' -GCTCTAG AGTAATG A 
TG 1THTACGTCAGG-3' 


5 '-CCCAAGCTTTCATAAT 
CCGGGTACTCCGGT-3 ' 


Fx (human) 


5'-GCTCTAGAGACATG 
GGTGAACCCCAGGGAT-3 ' 


5'-ACGAAGCTTCACTTCC 
GGGCCTGCTCGTAGTTG-3 ' 



As an alternative to cloning an epimerase/reductase-encoding nucleic acid, a 
suitable nucleic acid can be chemically synthesized from a known sequence that encodes a 
5 YEF B polypeptide or a related enzyme that has both epimerase and reductase activities. 
Direct chemical synthesis methods include, for example, the phosphotriester method of 
Narang et al (1979) Meth Enzymol 68: 90-99; the phosphodiester method of Brown et al 
(1 979) Meth Enzymol 68: 1 09-1 5 1 ; the diethylphosphoramidite method of Beaucage et al 
(1981) Tetra. Lett., 22: 1859-1862; and the solid support method of U.S. Patent No. 

10 4,458,066. Chemical synthesis produces a single stranded oligonucleotide. This can be 

converted into double stranded DNA by hybridization with a complementary sequence, or by 
polymerization with a DNA polymerase using the single strand as a template. One of skill 
would recognize that while chemical synthesis of DNA is often limited to sequences of about 
1 00 bases, longer sequences may be obtained by the ligation of shorter sequences. 

15 Alternatively, subsequences may be cloned and the appropriate subsequences cleaved using 
appropriate restriction enzymes. The fragments may then be ligated to produce the desired 
DNA sequence. . 

A nucleic acid encoding a GDP-fucose-synthesizing enzyme can be identified 
by detecting its expressed product by means of assays based on the physical, chemical, or 
20 immunological properties. For example, one can identify a cloned GDP-fucose-synthesizing 
nucleic acid by the ability of a polypeptide encoded by the nucleic acid to/catalyze the 
conversion of GDP-mannose to GDP-fucose. Other physical properties of a polypeptide 
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expressed from a particular nucleic acid can be compared to properties of known YEF B-like 
polypeptides to provide another method of identifying epimerase/reductase-encoding nucleic 
acids. Alternatively, a putative epimerase/reductase gene can be mutated, and its role as an 
epimerase/reductase established by detecting a variation in the ability to produce GDP- 
5 fucose. 

In some embodiments, it may be desirable to modify the epimerase/reductase- 
encoding nucleic acids. One of skill will recognize many ways of generating alterations in a 
given nucleic acid construct. Such well-known methods include site-directed mutagenesis, 
PCR amplification using degenerate oligonucleotides, exposure of cells containing the 

10 nucleic acid to mutagenic agents or radiation, chemical synthesis of a desired oligonucleotide 
(e.g., in conjunction with ligation and/or cloning to generate large nucleic acids) and other 
well-known techniques. See, e.g., Giliman and Smith (1979) Gene 8:81-97, Roberts et al. 
(1987) Nature 328: 731-734. 

In a preferred embodiment, the recombinant nucleic acids present in the cells 

1 5 of the invention are modified to provide preferred codons which enhance translation of the 
nucleic acid in a selected organism (e.g., yeast preferred codons are substituted into a coding 
nucleic acid for expression in yeast). 

2. Expression Vectors and Methods for Expressing Enzymes Involved in 
GDP -Fucose Synthesis 

20 The invention provides expression cassettes, and expression vectors 

containing the cassettes, that are useful for expressing the GDP-fucose synthesizing 
enzymes, in particular polypeptides that have both epimerase and reductase activities. These 
expression cassettes include the nucleic acids encoding GDP-fucose-synthesizing enzymes, 
which are operably linked to a promoter that is functional in a desired host cell. The 

25 expression cassettes can also include other sequences involved in transcription, translation, 
and post-translational modification of the enzyme. Such sequences are described in more 
detail below. The invention also provides expression vectors, and host cells that comprise the 
recombinant nucleic acids described herein. 

Typically, the polynucleotide that encodes the enzyme involved in nucleotide 

30 sugar synthesis is placed under the control of a promoter that is functional in the desired host 
cell. An extremely wide variety of promoters are well known, and can be used in the vectors 
of the invention, depending on the particular application. Ordinarily, the promoter selected 
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depends upon the cell in which the promoter is to be active. Other expression control 
sequences such as ribosome binding sites, transcription termination sites and the like are also 
optionally included. Expression control sequences that are suitable for use in a particular 
host cell are often obtained by cloning a gene that is expressed in that cell. 

In another embodiment, the invention provides recombinant cells that are 
useful for producing enzymes that catalyze one or more steps in the conversion of GDP- 
mannose to GDP-fucose. The recombinant cells contain an expression cassette, preferably 
included on an expression vector (unless the expression cassette is integrated into the 
genome of the host cell). Host cells of the invention can be plant cells or microorganisms, 
such as, for example, yeast cells, bacterial cells, or fungal cells. Examples of suitable cells 
include, for example, Azotobacter sp. {e.g., A. vinelandii), Pseudomonas sp., Rhizobium sp., 
Erwinia sp., Escherichia sp. (e.g., E. coli), and Klebsiella sp., among many others. The cells 
can be of any of several genera, including Saccharomyces (e.g., S. cerevisiae), Candida (e.g., 
C. utilis, C. parapsilosis t C krusei, C. versatilis, C. lipolytica, C. zeylanoides, C. 
guilliermondii, G albicans, and C. humicola), Pichia (e.g., P.farinosa and P. ohmeri), 
Torulopsis (e.g., T. Candida, T. sphaerica, T. xylinus, T.famata, andT. versatilis), 
Debaryomyces (e.g.,D. subglobosus, D. cantarellii, D. globosus, D. hansenii, and D. 
japonicus), Zygosaccharomyces (e.g., Z. rouxii and Z. bailii), Kluyveromyces (e.g., K. 
marxianus), Hansenula (e.g., H. anomala and K jadinii), and Brettanomyces (e.g., B. 
lambicus and B. anomalus). 

A promoter and other control signals can be derived from a gene that is under 
investigation, or can be a heterologous promoter or other signal that is obtained from a 
different gene, or from a different species. Where continuous expression of a gene is desired, 
one can use a "constitutive" promoter, which is generally active under most environmental 
conditions and states of development or cell differentiation. Suitable constitutive promoters 
for use in plants include, for example, the cauliflower mosaic virus (CaMV) 35S 
transcription initiation region and region VI promoters, the l f - or 2 1 - promoter derived from 
T-DNA of A grobacterium tumefaciens, and other promoters active in plant cells that are 
known to those of skill in the art. Other suitable promoters include the full-length transcript 
promoter from Figwort mosaic virus, actin promoters, histone promoters, tubulin promoters, 
or the mannopine synthase promoter (MAS). Other constitutive plant promoters include 
various ubiquitin or polyubiquitin promoters derived from, inter alia, Arabidopsis (Sun and 
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Callis, Plant J., 1 1(5):1017-1027 (1997)), the mas, Mac or DoubleMac promoters (described 
in United States Patent No. 5,106,739 and by Comai et al 9 Plant Mol Biol 15:373-381 
(1990)) and other transcription initiation regions from various plant genes known to those of 
skill in the art. Such genes include for example, ACT 11 from Arabidopsis (Huang et al 9 
5 Plant Mol Biol 33:125-139 (1996)), Cat3 from Arabidopsis (GenBankNo. U43147, Zhong 
et ai 9 Mol Gen. Genet. 251 : 196-203 (1996)), the gene encoding stearoyl-acyl carrier 
protein desaturase from Brassica napus (Genbank No. X74782, Solocombe et al, Plant 
Physiol 104:1 167-1 176 (1994)), GPcl from maize (GenBank No. X15596, Martinez et al 9 
J. Mol Biol 208:551-565 (1989)), and Gpc2 from maize (GenBank No. U45855, Manjunath 

10 et al 9 Plant Mol Biol 33:97-1 12 (1997)). Useful promoters for plants also include those 
obtained from Ti- or Ri-plasmids, from plant cells, plant viruses or other hosts where the 
promoters are found to be functional in plants. Bacterial promoters that function in plants, 
and thus are suitable for use in the methods of the invention include the octopine synthetase 
promoter, the nopaline synthase promoter, and the manopine synthetase promoter. Suitable 

1 5 endogenous plant promoters include the ribulose-1 ,6-biphosphate (RUBP) carboxylase small 
subunit (ssu) promoter, the (a-conglycinin promoter, the phaseolin promoter, the ADH 
promoter, and heat-shock promoters. 

Promoters for use in E. coli include the T7, trp, or lambda promoters. A 
ribosome binding site and preferably a transcription termination signal are also provided. For 

20 eukaryotic cells, the control sequences typically include a promoter which optionally 

includes an enhancer derived from immunoglobulin genes, SV40, cytomegalovirus, etc., and 
a polyadenylation sequence, and may include splice donor and acceptor sequences. 

In yeast, convenient promoters include GAL 1-10 (Johnson and Davies (1984) 
Mol Cell Biol 4:1440-1448) ADH2 (Russell et al (1983) J. Biol. Chenu 258:2674-2682), 

25 PH05 (EMBOJ. (1982) 6:675-680), and MFct (Herskowitz and Oshima (1982) in The 
Molecular Biology of the Yeast Saccharomyces (eds. Strathern, Jones, and Broach) Cold 
Spring Harbor Lab., Cold Spring Harbor, N.Y., pp. 181-209). Another suitable promoter for 
use in yeast is the ADH2/GAPDH hybrid promoter as described in Cousens et al. 9 Gene 
61:265-275 (1987). For filamentous fungi such as, for example, strains of the fungi 

30 Aspergillus (McKnight et al 9 U.S. Patent No. 4,935,349), examples of useful promoters 
include those derived from Aspergillus nidulans glycolytic genes, such as the ADH3 
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promoter (McKnight et al,EMBOJ. 4: 2093 2099 (1985)) and the (pi A promoter. An 
example of a suitable terminator is the ADH3 terminator (McKnight et al.). 

In some embodiments, the polynucleotides are placed under the control of an 
inducible promoter, which is a promoter that directs expression of a gene where the level of 
expression is alterable by environmental or developmental factors such as, for example, 
temperature, pH, anaerobic or aerobic conditions, light, transcription factors and chemicals. 
Such promoters are referred to herein as "inducible" promoters, which allow one to control 
the timing of expression of the glycosyltransferase or enzyme involved in nucleotide sugar 
synthesis. For E. coli and other bacterial host cells, inducible promoters are known to those 
of skill in the art. These include, for example, the lac promoter. A particularly preferred 
inducible promoter for expression in prokaryotes is a dual promoter that includes a tac 
promoter component linked to a promoter component obtained from a gene or genes that 
encode enzymes involved in galactose metabolism (e.g., a promoter from a UDPgalactose 
4-epimerase gene (galE)). The dual tac-gal promoter, which is described in 
PCT/US97/20528 (Infl. Publ. No. WO 98201 11), provides a level of expression that is 
greater than that provided by either promoter alone. 

Inducible promoters for use in plants are known to those of skill in the art 
(see, e.g., references cited in Kuhlemeier et al (1987) Ann. Rev. Plant Physiol. 38:221), and 
include those of the 1,5-ribulose bisphosphate carboxylase small subunit genes of 
Arabidopsis thaliana (the "ssu" promoter), which are light-inducible and active only in 
photosynthetic tissue, anther-specific promoters (EP 344029), and seed-specific promoters 
of, for example, Arabidopsis thaliana (Krebbers et al. (1988) Plant Physiol. 87:859). 

Inducible promoters for other organisms are also well known to those of skill 
in the art. These include, for example, the arabinose promoter, the lacZ promoter, the 
metallothionein promoter, and the heat shock promoter, as well as many others. 

A construct that includes a polynucleotide of interest operably linked to gene 
expression control signals that, when placed in an appropriate host cell, drive expression of 
the polynucleotide is termed an "expression cassette." Expression cassettes that encode the 
YEF B and/or other enzyme involved in nucleotide sugar synthesis are often placed in 
expression vectors for introduction into the host cell. The vectors typically include, in 
addition to an expression cassette, a nucleic acid sequence that enables the vector to replicate 
independently in one or more selected host cells. Generally, this sequence is one that enables 
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the vector to replicate independently of the host chromosomal DNA, and includes origins of 
replication or autonomously replicating sequences. Such sequences are well known for a 
variety of bacteria. For instance, the origin of replication from the plasmid pBR322 is 
suitable for most Gram-negative bacteria. Alternatively, the vector can replicate by 
5 becoming integrated into the host cell genomic complement and being replicated as the cell 
undergoes DNA replication. A preferred expression vector for expression of the enzymes is 
in bacterial cells is pTGK, which includes a dual tac-gal promoter and is described in 
PCTYUS97/20528 (Int'l. Publ. No. WO 98201 1 1). 

The construction of polynucleotide constructs generally requires the use of 

10 vectors able to replicate in bacteria. A plethora of kits are commercially available for the 
purification of plasmids from bacteria. For their proper use, follow the manufacturer's 
instructions {see, for example, EasyPrepJ, FlexiPrepJ, both from Pharmacia Biotech; 
StrataCleanJ, from Stratagene; and, QIAexpress Expression System, Qiagen). The isolated 
and purified plasmids can then be further manipulated to produce other plasmids, and used to 

1 5 transfect cells. Cloning in Streptomyces or Bacillus is also possible. 

Selectable markers are often incorporated into the expression vectors used to 
construct the cells of the invention. These genes can encode a gene product, such as a 
protein, necessary for the survival or growth of transformed host cells grown in a selective 
culture medium. Host cells not transformed with the vector containing the selection gene 

20 will not survive in the culture medium. Typical selection genes encode proteins that confer 
resistance to antibiotics or other toxins, such as ampicillin, neomycin, kanamycin, 
chloramphenicol, or tetracycline. Alternatively, selectable markers may encode proteins that 
complement auxotrophic deficiencies or supply critical nutrients not available from complex 
media, e.g., the gene encoding D-alanine racemase for Bacilli. Often, the vector will have 

25 one selectable marker that is functional in, e.g., E. coli, or other cells in which the vector is 
replicated prior to being introduced into the target cell. A number of selectable markers are 
known to those of skill in the art and are described for instance in Sambrook et al. 9 supra. A 
preferred selectable marker for use in bacterial cells is a kanamycin resistance marker (Vieira 
and Messing, Gene 19: 259 (1982)). Use of kanamycin selection is advantageous over, for 

30 example, ampicillin selection because ampicillin is quickly degraded by p-lactamase in 
culture medium, thus removing selective pressure and allowing the culture to become 
overgrown with cells that do not contain the vector. 
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Suitable selectable markers for use in mammalian cells include, for example, 
the dihydrofolate reductase gene (DHFR), the thymidine kinase gene (TK), or prokaryotic 
genes conferring drug resistance, gpt (xanthine-guanine phosphoribosyltransferase, which 
can be selected for with mycophenolic acid; neo (neomycin phosphotransferase), which can 
5 be selected for with G4 1 8, hygromycin, or puromycin; and DHFR (dihydrofolate reductase), 
which can be selected for with methotrexate (Mulligan & Berg (1981) Proc. Nat 7. Acad, Sci. 
USA 78: 2072; Southern & Berg (1982)7. Mol Appl Genet, 1: 327). 

Selection markers for plant and/or other eukaryotic cells often confer 
resistance to a biocide or an antibiotic, such as, for example, kanamycin, G 418, bleomycin, 

10 hygromycin, or chloramphenicol, or herbicide resistance, such as resistance to chlorsulfuron 
or Basta. Examples of suitable coding sequences for selectable markers are: the neo gene 
which codes for the enzyme neomycin phosphotransferase which confers resistance to the 
antibiotic kanamycin (Beck et al (1982) Gene 19:327); the hyg gene, which codes for the 
enzyme hygromycin phosphotransferase and confers resistance to the antibiotic hygromycin 

1 5 (Gritz and Davies (1 983) Gene 25: 1 79); and the bar gene (EP 242236) that codes for 

phosphinothricin acetyl transferase which confers resistance to the herbicidal compounds 
phosphinothricin and bialaphos. 

Construction of suitable vectors containing one or more of the above listed 
components employs standard ligation techniques as described in the references cited above. 

20 Isolated plasmids or DNA fragments are cleaved, tailored, and re-ligated in the form desired 
to generate the plasmids required. To confirm correct sequences in plasmids constructed, 
the plasmids can be analyzed by standard techniques such as by restriction endonuctease 
digestion, and/or sequencing according to known methods. Molecular cloning techniques to 
achieve these ends are known in the art. A wide variety of cloning and in vitro amplification 

25 methods suitable for the construction of recombinant nucleic acids are well-known to 

persons of skill. Examples of these techniques and instructions sufficient to direct persons of 
skill through many cloning exercises are found in Berger and Kimmel, Guide to Molecular 
Cloning Techniques, Methods in Enzymology, Volume 152, Academic Press, Inc., San 
Diego, CA (Berger); and Current Protocols in Molecular Biology, F.M. Ausubel et al, cds., 

30 Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John 
Wiley & Sons, Inc., (1998 Supplement) (Ausubel). 
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A variety of common vectors suitable for use in constructing the expression 
vectors of the invention are well known in the art. For cloning in bacteria, common vectors 
include pBR322 derived vectors such as pBLUESCRIPT™, and X-phage derived vectors. In 
yeast, vectors include Yeast Integrating plasmids (e.g., YIp5) and Yeast Replicating 
5 plasmids (the YRp series plasmids) and pGPD-2. Expression in mammalian cells can be 
achieved using a variety of commonly available plasmids, including pSV2, pBC12BI, and 
p91023, as well as lytic virus vectors (e.g., vaccinia virus, adeno virus, and baculovirus), 
episomal virus vectors (e.g., bovine papillomavirus), and retroviral vectors (e.g., murine 
retroviruses). 

1 0 Suitable methods for introducing the expression vectors into a chosen host 

cell are known to those of skill in the art. For example, the expression vectors can be 
introduced into prokaryotic cells, including E. coli, by calcium chloride transformation, and 
into eukaryotic cells by calcium phosphate treatment or electroporation. Other 
transformation methods are also suitable. 

1 5 Once expressed, the recombinant GDP-fucose-synthesizing enzymes can be 

purified according to standard procedures of the art, including ammonium sulfate 
precipitation, affinity columns, column chromatography, gel electrophoresis and the like 
(see, generally, R. Scopes, Protein Purification, Springer-Verlag, N.Y. (1982), Deutscher, 
Methods in Enzymology Vol 182: Guide to Protein Purification., Academic Press, Inc. 

20 N.Y. (1990)). Substantially pure compositions of at least about 90 to 95% homogeneity are 
preferred, and 98 to 99% or more homogeneity are most preferred. Once purified, partially 
or to homogeneity as desired, the polypeptides may then be used (e.g., as immunogens for 
antibody production). 

In some embodiments, however, the enzymes are used in an unpurified state, 

25 e.g., as a cell lysate. The recombinant cells of the invention are grown in culture to obtain a 
sufficient number of cells for use in a reaction of a desired scale. Methods and culture media 
for growth of the respective host cells are well known to those of skill in the art. Culture can 
be conducted in, for example, aerated spinner or shaking culture, or, more preferably, in a 
fermentor. Upon growth of the recombinant cells to a desired cell density, the cells are 

30 typically processed for use in the reaction mixtures and methods of the invention. For 

example, the cells are generally permeabilized or otherwise disrupted to allow entry of the 
soluble saccharide acceptors into the cells. The YEF B or other enzyme produced by the cells 
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can, in some situations, diffuse from the cells into the extracellular fluid. Methods of 
permeabilizing cells so as to not significantly degrade enzymatic activity are known to those 
of skill in the art. Cells can be subjected to concentration, drying, lyophilization, treatment 
with surfactants, ultrasonic treatment, mechanical disruption, enzymatic treatment, and the 
5 like. 

The treated cells are then used in a reaction mixture that contains additional 
reactants, known to those of skill in the art, that are necessary or desirable for the enzymatic 
activity of the GDP-fucose-synthesizing enzyme. The concentration of treated cells used in 
the reaction mixture is typically between about 0.1% (wet wt/vol) and 40% (wet wt/vol), 

1 0 more preferably between about 1 % (wet wt/vol) and about 20% (wet wt/vol), and most 

preferably between about 2% (wet wt/vol) and about 1 0% (wet wt/vol), or a corresponding 
amount of dry cells. 

One of skill in the art would recognize that after chemical synthesis, 
biological expression, or purification, the GDP-fucose-synthesizing enzyme(s) may possess a 

1 5 conformation substantially different than the native conformations of the constituent 

polypeptides. In this case, it may be necessary to denature and reduce the polypeptide and 
then to cause the polypeptide to re-fold into the preferred conformation. Methods of 
reducing and denaturing proteins and inducing re-folding are well known to those of skill in 
the art (See, Debinski et al (1993) J. Biol Chem., 268: 14065-14070; Kreitman and Pastan 

20 (1993) Bioconjug. Chem., 4: 581-585; and Buchner, et al, (1992) Anal Biochem., 205: 263- 
270). Debinski et al, for example, describe the denaturation and reduction of inclusion 
body proteins in guanidine-DTE. The protein is then refolded in a redox buffer containing 
oxidized glutathione and L-arginine. One significant advantage of using a prokaryotic 
enzyme such as YEF B in a prokaryotic expression system, such as is used in a preferred 

25 embodiment of the instant invention, is that denaturation and refolding are not usually 

required. Active prokaryotic epimerase/reductase is obtained in large quantities without these 
additional steps. 

One of skill would recognize that modifications can be made to the GDP- 
fucose-synthesizing enzymes without diminishing their biological activity. Some 
30 modifications may be made to facilitate the cloning, expression, or incorporation of the 

targeting molecule into a fusion protein. Such modifications are well known to those of skill 
in the art and include, for example, a methionine added at the amino terminus to provide an 



WO 99/36555 



PCT/US99/00893 



initiation site, or additional amino acids (e.g., poly His) placed on either terminus to create 
conveniently located restriction sites or termination codons or purification sequences. 

B* Enzymatic Conversion of GDP-Mannose to GDP-Fucose 

The invention also provides methods for enzymatically converting GDP- 
5 mannose to GDP-fucose. The methods often involve using one or more of GDP-mannose 
dehydratase (GMD) and epimerase/reductase enzymes. In presently preferred embodiments, 
the methods use a prokaryotic GMD enzyme, as well as a prokaryotic enzyme that has both 
epimerase and reductase activities. An example of a preferred enzyme that has both activities 
is the YEF B polypeptide of E. coli. In other embodiments, a eukaryotic enzyme such a 
1 0 human Fx protein can be used. The methods of the invention provide a means by which 

GDP-fucose can be produced efficiently and in high purity using relatively low-cost starting 
materials. 

Several embodiments of the invention provide methods by which the 
efficiency of the GDP-mannose to GDP-fucose reaction can be substantially increased. In 

1 5 previously known methods, the enzymes in the enzymatic pathway are used in a coupled 
reaction, in which GDP-4-keto-6-deoxymannose is produced and simultaneously 
epimerized/reduced to obtain GDP-fucose. However, the GMD enzyme that catalyzes the 
initial step of the synthetic pathway, the conversion of GDP-mannose to GDP-4-keto-6- 
deoxymannose, is strongly inhibited by GDP-fucose. At 50 \iM GDP-fucose, GDP-mannose 

20 dehydratase possesses only 1 0% of its catalytic activity (Broschat et al (1985) Eur. J. 

Biochem. 153: 397-401). To avoid the reduction in yield caused by this product inhibition, 
the methods of the invention involve, in some embodiments, allowing the first step of the 
pathway to proceed nearly or completely to completion before adding the second enzyme of 
the pathway (e.g., the epimerase/reductase). In presently preferred embodiments, the 

25 conversion of GDP-mannose to GDP-4-keto-6-deoxymannose is at least about 80% 

complete, more preferably at least about 90% complete, and still more preferably at least 
about 95% complete, at the time the subsequent enzyme or enzymes is added to the reaction 
mixture. The reaction mixtures for the GMD-catalyzed reaction require the presence of 
NADP\ In another embodiment, a fucosyltransferase and acceptor saccharide are added to 

30 the product of the GMD reaction, GDP-4-keto-6-deoxymannose, along with the 
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epimerase/reductase. The GDP-fucose formed by the epimerase/reductase is consumed by 
the fucosyltransferase, thus preventing the feedback inhibition of GMD. 

The invention also provides reaction mixtures for enzymatic synthesis of 
GDP-fucose. In some embodiments, the reaction mixtures include GDP-4-keto-6- 
5 deoxymannose, NADPH or NADH, and one or more polypeptides that have epimerase 
activity and reductase activity (e.g., YEF B or a suitable analog), the latter polypeptides 
preferably from a prokaryote. The epimerase/reductase polypeptide catalyzes the 
epimerization and reduction of GDP-4-keto-6-deoxymannose to form GDP-fucose. The 
GDP-4-keto-6-deoxymannose is preferably obtained by GMD-catalyzed conversion from 

1 0 GDP-mannose as described above. 

The epimerase/reduction to obtain GDP-fucose requires either NADPH or 
NADH. In some embodiments, one can employ a regenerating system by which the oxidized 
electron donor is regenerated to NADPH or NADH. For example, one can include in the 
reaction mixture an enzyme that utilizes NADP + or NAD + to oxidize a substrate for the 

1 5 enzyme, which substrate is also included in the reaction mixture. Examples of suitable 

recycling enzymes include, for example, alcohol dehydrogenase, glucose dehydrogenase, 
formate dehydrogenase, hydrogenase, and glucose-6-phosphate dehydrogenase (see, e.g., 
Wong et al (1985) J. Am. Chem. Soc. 107: 4028-4031). In a presently preferred 
embodiment, glucose dehydrogenase and its substrate glucose are used to regenerate 

20 NADPH or NADH. 

The invention also provides methods and reaction mixtures in which the 
GDP-fucose-synthesizing reactions are coupled to fucosyltransferase reactions in which the 
fucose residue is transferred from GDP-fucose to a suitable saccharide acceptor. A number 
of suitable fucosyltransferases are known to those of skill in the art. Briefly, 

25 fucosyltransferases include any of those enzymes which transfer L-fucose from GDP-fucose 
to a hydroxy position of an acceptor sugar. An example of an acceptable acceptor sugar is a 
GlcNAc in a pGal(l-*4)pGlcNAc group in an oligosaccharide glycoside. Suitable 
fucosyltransferases then include the known pGal(l->3,4)pGlcNAc 

a(l->3,4)fiicosyltransferase (FTIII E.C. No. 2.4.1.65) which can be obtained from human 
30 milk (see, Palcic, et al, Carbohydrate Res. 190:1-1 1 (1989); Prieels, et al, J. Biol. Chem. 
256:10456-10463 (1981); and Nunez, et aL, Can. J. Chem. 59:2086-2095 (1981)) and the 
pGal(l->4)pGlcNAc a(l -»3)focosyltransferases (FTIV, FTV, FTVI, and FTVII, E.C. No. 
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2.4.1 .65) which are found in human serum. A recombinant form of (3Gal(l -»3,4)pGlcNAc 
a(l-»3,4)fucosyltransferase is also available (see, Dumas, et al, Bioorg. Med. Letters 
1:425-428 (1991) and Kukowska-Latallo, ei ai, Genes and Development 4:1288-1303 
(1990)). Other exemplary fucosyltransferases include ctl,2 fucosyltransferase (E.C. No. 
5 2.4.1 .69); additional fucosyltransferases are listed in Table 2. Enzymatic fucosylation may be 
carried out by the methods described in Mollicone, et ai, Eur. J. Biochem. 191:169-176 
(1990) or U.S. Patent No. 5,374,655. 

The invention also provides methods for improving the efficiency of 
fucosyltransferase-mediated transfer of fucose from GDP-fucose to an acceptor saccharide. 

1 0 One problem with coupled reactions in which GDP-mannose is converted to GDP-fucose, 
which is then nearly simultaneously transferred to an acceptor saccharide, is that GDP 
inhibits both GMD and fucosyltransferase. Thus, in one embodiment, the methods of the 
invention involve improving the efficiency of fucosylated saccharide production by 
conducting the enzymatic synthesis of GDP-fiicose separately from the fucosyltransferase 

1 5 reaction. Free GDP is not released, so the GMD is not inhibited. 

In an alternative embodiment, the GDP produced in conjunction with the 
fucosyltransferase reaction is converted to GTP, which does not inhibit fucosyltransferase or 
GMD. For example, the reaction mixture can include a kinase and corresponding substrate. 
Suitable kinases include, for example, polyphosphate kinase (EC 2.7.4.1), nucleoside 

20 phosphate kinases (EC 2.7.4.4), creatine kinase (EC 2.7.3.2); myokinase (EC 2.7.4.3); 
myokinase (EC 2.7.4.3); acetyl kinase {e.g., EC 2.7.1.59); acetyl phosphate kinase; and 
pyruvate kinase (EC 2.7.1 .40). Suitable substrates for these enzymes are known to those of 
skill in the art. A reaction scheme that incorporates this embodiment is shown in Figure 9. 

In a preferred embodiment, the GDP-fucose synthesizing enzymes are used in 

25 an enzymatic cycle. Suitable enzymatic cycles include half-cycles such as is shown in Figure 
15, which uses GDP-mannose as the starting material, and full cycles, such as that shown in 
Figure 16, which uses mannose as the starting material. An example of a GDP-mannose half- 
cycle (Figure 15) uses glucose dehydrogenase to enzymatically recycle NADPH, a cofactor 
required by the epimerase/reductase polypeptides, and fucosyltransferase V to transfer fucose 

30 to SLN, resulting in formation of a sialyl-Lewis x compound. Since GDP inhibits both the 
fucosyltransferase and GDP-mannose dehydratase, a phosphatase that is able to cleave the 
GDP but does not cleave the phosphate from NADP is preferably. added to the cycle. MgCl 2 
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is preferred as a source of divalent cation, as YEF B is inhibited by MnCl 2 . In addition to 
GDP-mannose dehydratase and epimerase/reductase polypeptides, the GDP-mannose half- 
fucosyltransferase cycle preferably includes the following: an enzyme that recycles NADPH, 
such as glucose dehydrogenase, a substrate for the NADPH regenerating enzyme such as 
5 glucose or galactose, fucosyltransferase, an acceptor oligosaccharide or glycoconjugate, 
NADP, GDP-mannose and a buffer system. 

Alternatively, the enzymes can be used in a "mannose full cycle," as shown in 
Figure 16. GDP-mannose dehydratase and an epimerase/reductase polypeptide can be used 
in conjunction with GDP-mannose pyrophosphorylase, GDP-phosphomannomutase, 

1 0 hexokinase and pyruvate kinase in a complete or "full" mannose fucosyltransferase cycle. 
The full cycle allows one to use simply mannose and the desired acceptor saccharide as the 
starting material. Either recombinant or naturally occurring enzymes can be used. For full- 
cycle reactions, the reaction mixtures are preferably free of phosphatases. 

The reaction methods of the invention can be optimized by altering the 

1 5 divalent cations present in the reaction mixture, the pH of the reaction, temperature, and 

additives such as alkaline phosphatase that may improve the reaction kinetics by removal of 
inhibitory bi-products such as GDP. For example, as discussed above, GDP is known to 
- inhibit both GDP-mannose dehydratase and fucosyltransferase (Broschat et al (1985) Eur. J. 
Biochem. 153: 397-401; Staudacher, E. (1996) TIGG 8: 391-408). 

20 Further improvement in the fucosylation reactions can be obtained by 

supplementing the divalent cations present in the reaction mixture as the reaction proceeds. 
Glycosyltransferase cycles (See, U.S. Patent No. 5,374,541 and WO 9425615 A) typically 
produce one or more moles of inorganic pyrophosphate for each mole of product formed and 
are typically carried out in the presence of a divalent metal ion. The metal ion is a cofactor 

25 for at least one of the enzymes in each of the cycles. However, the divalent metal cations 

play a dual role in the glycosyltransferase cycles. In particular, the divalent metal ion forms a 
complex of very low solubility with inorganic phosphate or pyrophosphate produced by 
various processes in the glycosyltransferase cycles. As a result, the metal ion can remove the 
Pi or PPi from the reaction by precipitation. This, in turn, results in reduced amounts of 

30 metal ions present in solution and a corresponding decrease in the overall turnover rates for 
those enzymes which require the metal ion cofactors. To circumvent this problem, the 
methods of the invention can include adding sufficient divalent metal cation to the reaction 



WO 99/36555 



32 



PCT/US99/00893 



10 



medium to restore a portion of the divalent cation lost during the course of the reaction to 
thereby achieve or maintain a concentration of divalent metal cation in the reaction medium 
between about 1 mM and about 75 mM. Preferably, the addition of divalent metal cation 
occurs without interruption of said enzymatic conversion. See, e.g., International Patent 
Application Publ. No. W096/32491. 

Once the reactions of the invention are optimized for the GDP-mannose 
dehydratase and an epimerase/reductase such as YEF B, other analogs of these enzymatic 
activities can be tested using a similar process. For example, human Fx can be substituted for 
YEF B. 

Table 2: Fucosyltransferases that are useful in fucosyltransferase cycles. 



Fucosyltransferase 



Reference 



al,3/4Fuc-TIII 
al,3Fuc-TIV 
al,3Fuc-TV 
al,3Fuc-TVI 
al,3Fuc-TVII 
al,3Fuc-T bacterial 
al,2FucI 
al,2Fuc -Til 



Kukowska-Latallo et al (1990) Genes Dev. 4: 1288-1303 

Kumar et al (1991) J. Biol. Chem. 266: 21777-21783 

Weston et al (1992) J. Biol. Chem. 267: 4152-4160 

Weston et al (1992) J. Biol. Chem. 267: 24575-24584 

Natsuka et al (1994) J. Biol. Chem. 270: 201 12-20122 

Ge et al (1997) J. Biol. Chem. 272: 21357-21363 

Sarnesto et al (1992) J. Biol. Chem. 267: 2745-2752 

Larsen et al (1990) Proc. Nat 'I. Acad. Sci. USA 87: 6674- 
6678 



For glycosyltransferase cycles, the concentrations or amounts of the various 
1 5 reactants used in the processes depend upon numerous factors including reaction conditions 
such as temperature and pH value, and the choice and amount of acceptor saccharides to be 
glycosylated. Because the glycosylation process permits regeneration of activating 
nucleotides, activated donor sugars and scavenging of produced PPi in the presence of 
catalytic amounts of the enzymes, the process is limited by the concentrations or amounts of 
20 the stoichiometric substrates discussed before. The upper limit for the concentrations of 
reactants that can be used in accordance with the method of the present invention is 
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determined by the solubility of such reactants. Preferably, the concentrations of activating 
nucleotides, phosphate donor, the donor sugar and enzymes are selected such that 
glycosylation proceeds until the acceptor is consumed, thus completely sialylating the 
saccharide groups present on the glycoprotein. 

Enzyme amounts or concentrations are expressed in activity Units, which is a 
measure of the initial rate of catalysis. One activity Unit catalyzes the formation of 1 (irnol 
of product per minute at a given temperature (typically 37°C) and pH value (typically 7.5). 
Thus, 10 Units of an enzyme is a catalytic amount of that enzyme where 10 pmols of 
substrate are converted to 10 jimol of product in one minute at a temperature of 37°C and a 
pH value of 7.5. 

The above ingredients are typically combined by admixture in an aqueous 
reaction medium (solution). That medium has a pH value of about 6 to about 8.5. The 
medium is devoid of chelators that bind enzyme cofactors such as Mg +2 or Mn +2 . The 
selection of a medium is based on the ability of the medium to maintain pH value at the 
desired level. Thus, in some embodiments, the medium is buffered to a pH value of about 
7.5, preferably with HEPES. If a buffer is not used, the pH of the medium should be 
maintained at about 6 to 8.5, preferably about 7.2 to 7.8, by the addition of base. A suitable 
base is NaOH, preferably 6 M NaOH. 

The reaction medium may also comprise solubilizing detergents (e.g., Triton 
or SDS) and organic solvents such as methanol or ethanol, if necessary. The enzymes can be 
utilized free in solution or can be bound to a support such as a polymer. The reaction 
mixture is thus substantially homogeneous at the beginning, although some precipitate can 
form during the reaction. 

The temperature at which an above process is carried out can range from just 
above freezing to the temperature at which the most sensitive enzyme denatures. That 
temperature range is preferably about zero degrees C to about 45°C, or up to 100°C or more 
for an enzyme obtained from a thermophylic organism. A typical range for non-thermophilic 
enzymes is about 20°C to about 37°C. 

The reaction mixture so formed is maintained for a period of time sufficient 
for the desired percentage of acceptor sugar residues to be fiicosylated. For commercial- 
scale preparations, the reaction will often be allowed to proceed for about 8-240 hours, with 
a time of between about 24 and 48 hours being more typical. 
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D. Uses of GDP-Fucose synthesized using the methods of the invention 

The methods of the invention are useful for synthesis of GDP-fucose, which 
finds use in the synthesis of a wide variety of carbohydrate compounds of interest. The 
carbohydrate compounds can be oligosaccharide or polysaccharide structures, on 
5 glycoproteins, glycolipids, or free molecules. 

GDP-fucose produced using the methods of the invention can be used, for . 
example, for the synthesis of a wide variety of fucosylated carbohydrates. In conjunction 
with appropriate fiicosyltransferase enzymes, the following carbohydrate structures are 
among those that one can obtain using the GDP-fucose: (1) Fuca(l— >2)Galp-; (2) 
10 Gaip(1^3)[Fuca(l->4)]GlcNAcP-; (3) Gaip(l->4)[Fuca(l->3)]GlcNAcp-; (4) 

Gaip(W4)[Fuca(l-»3)]Glc; (5) -GlcNAcP(l-^4)[Fuca(l-^6)]GlcNAcpi-^Asn; (6) - 
GlcNAcp(l->4)[Fuca(l->3)GlcNAcpi-»Asn; (7) Fuca(l-»6)Gaip->; (8) 
Fucct(l->3)Gaip-; (9) GlcP(l->3)Fucal->0-Thr and Fucal->0-Thr/Ser; (10) 
Fucal — »Ceramide; and (11) Fuca(l -*3)Fuc. Examples of products that can be formed 
15 using GDP-fucose as a reactant include, but are not limited to, those listed in Table 3. 

Table 3: Oligosaccharide Structures Synthesized using GDP-fucose and Fucosyltransferase 



Oligosaccharide 


Tissue source 


III 3 Fucosyl-para-lacto-N-hexaose 


Human milk 


3'-Sialyl-3-fiicosyllactose 


Human milk 


Lewis X 


hematopoietic cells 


Lewis A 


hematopoietic cells 


Sialyl lewis X 


hematopoietic cells 


Sialyl lewis A 


hematopoietic cells 


Lacto-N-difucohexaose II 


Human milk 


Lacto-N-fucopentaose I 


Human milk 


Lacto-N-fucopentaose II 


Human milk 


2*-Fucosyllactose 


Human milk 


Lactodifiicotetraose 


Human milk 


3-Fucosyllactose 


Human milk 


Lacto-N-fucopentaose III 


Human milk 


Lacto-N-difucohexaose I 


Human milk 


Lacto-N-fucopentaose V 


Human milk 



For example, the present invention provides reaction mixtures and methods 
20 for the preparation of compounds having the formula Gaip(l -4)(Fucal -»3)Glc-R, Gaip(l -3) 
(Fuccxl-»4)Glc-R, Gaip(1^4)(Fucal^3)GlcN(R , )P-R and Gal£(l-*3XFucal->4) 
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GlcN(R')p-R. In these embodiments, the acceptor saccharide is Gaip(l-4)Glc-R, Gal(3(l- 
3)Glc-R, GaipCl^GlcNCROP-R or Gaip(l-»3)GlcN(R')P-R, respectively, wherein: 

R is selected from the group consisting of hydrogen, a saccharide, an 
oligosaccharide and an aglycon group having at least one carbon atom; and 
5 R ! is selected from the group consisting of acetyl and allyloxycarbonyl. 

Suitable acceptor saccharides also include, for example, NeuAca(2-»3)Gal 
P(l->4)GlcN(R , )P(l->3)Gaip-OR and NeuAca(2^3)Gaip(l-^3)GlcN(R , )p(l-^3)Gaip~OR. 
These acceptor saccharides can be formed by sialylating a compound Galp(l-^4)GlcN(R l ) 
P(l->3)Gaip-OR or Galp(l->3)GlcN(R') P(l-»3)Gaip-OR with a sialyltransferase in the 

10 presence of a CMP derivative of a sialic acid using a a(2,3)sialyltransferase under conditions 
wherein sialic acid is transferred to the non-reducing sugar of the compound. The 
compounds GalpO-^GlcNCR 1 ) P(l->3)Galp-OR and Gaip(l-»3)GlcN(R') P(l-»3)Gaip- 
OR can be formed by galactosylating a compound of the formula GlcN(R')p(l-»3)Galp-OR 
or GlcN(R')P(l->3)Galp-OR, respectively, with a galactosyl transferase in the presence of a 

15 UDP-galactose under conditions sufficient to form the compound. 

In some embodiments of the invention, methods are provided for synthesizing 
NeuAca(2^3)Gaip(1^4)(Fucal^3)GlcN(R0P(1^3)Gaip-ORorNeuAca(2-^3)Gal 
P(l-^3)(Fucal->4)GlcN(R , )P(l~>3)Gaip-OR. In these formulae, R is a hydrogen, a 
saccharide, an oligosaccharide or an aglycon group having at least one carbon atom. R* can 

20 be either acetyl or allyloxycarbonyl (Alloc). The term "aglycon group having at least one 

carbon atom" refers to a group — A-Z, in which A represents an alkylene group of from 1 to 
1 8 carbon atoms optionally substituted with halogen, thiol, hydroxy, oxygen, sulfur, amino, 
imino, or alkoxy; and Z is hydrogen, -OH, -SH, — NH 2 , -NHR\ -N(R ] ) 2 , -C0 2 H, -C0 2 R\ 
-CONH 2 , -CONHR 1 , -CONOR 1 )* -CONHNH 2 , or -OR 1 wherein «ach R 1 is independently 

25 alkyl of from 1 to 5 carbon atoms. In addition, R can be (CH 2 ) n CH(CH 2 ) m CH 3 

(CH 2 ) 0 CH 3 , 

where n,m,o =1-18; (CH 2 ) n -R 2 (in which n = 0-1 8), wherein R 2 is a variously substituted 
aromatic ring, preferably, a phenyl group, being substituted with one or more alkoxy groups, 
30 preferably methoxy or 0(CH 2 ) m CH 3 , (in which m = 0-18), or a combination thereof. 

The steps involved in synthesizing these compounds include: 
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(a) galactosylating a compound of the formula GlcNR'p(l->3)Gaip-OR 
with a galactosyltransferase in the presence of a UDP-galactose under conditions sufficient 
to form the compound: Gaip(1^4)GlcNR'p(1^3)Galp-OR; 

(b) sialylating the compound formed in (a) with a sialyltransferase in the 

5 presence of a CMP derivative of a sialic acid using a <x(2,3)sialyltransferase under conditions 
in which sialic acid is transferred to the non-reducing sugar to form the compound: 
NeuAca(2-^3)Galp(l-^4)GlcNR , p(l->3)Galp-OR; and 

(c) fucosylating the compound formed in (b) to provide the 
NeuAca(2->3)Galp(1^4)(Fucal->3)GlcNR , P(l-^3)Gaip-OR. 

10 The fucosylating step is preferably carried out using the methods of the 

invention, as described above. In some embodiments, lysates from recombinant cells that 
express one or more of the GDP-fiicose-synthesizing enzymes are used in the reactions. The 
glycosyltransferase reactions can also employ lysates of cells that produce the recombinant 
glycosyltransferases. In some embodiments, at least two of the reaction steps are carried out 

1 5 using recombinant cells that produce a glycosyltransferase or other enzyme or reactant. The 
different enzymes can be expressed by the same cell, or different recombinant cells which 
each contain an exogenous glycosyltransferase or GDP-fucose-synthesizing gene can be 
mixed together. Thus, by mixing and matching members of a set of recombinant cells, each 
of which contain a different nucleotide sugar-synthesizing gene, or glycosyltransferase, one 

20 can readily create a custom reaction mixture for performing many multi-step glycosylation 
reactions. 

In yet another aspect, the present invention provides methods for the 
preparation of compounds as described in WO 94/26760. Generally these compounds have 
the formula: 

25 NeuAca(2->3)Gaip(1^4)(Fucal->3)GlcN(R")P-OR 2 

In this formula, R M is alkyl or acyl from 1-18 carbons, 5,6,7,8 - tetrahydro-2-naphthamido; 
benzamido; 2-naphthamido; 4-amino benzamido; or 4-nitrobenzamido. R 2 may be the same 
as R as described above or may be Galp-OR (R is as described above). 

In the above descriptions, the terms are generally used according to their 

30 standard meanings. The term "alkyl" as used herein means a branched or unbranched, 

saturated or unsaturated, monovalent or divalent, hydrocarbon radical having from 1 to 20 
carbons, including lower alkyls of 1-8 carbons such as methyl, ethyl, n-propyl, butyl, n- 
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hexyl, and the like, cycloalkyls (3-7 carbons), cycloalkylmethyls (4-8 carbons), and 
arylalkyls. 

The term "aryl" refers to a radical derived from an aromatic hydrocarbon by 
the removal of one atom, e.g., phenyl from benzene. The aromatic hydrocarbon may have 
5 more than one unsaturated carbon ring, e.g., naphthyl. The term "alkoxy" refers to alkyl 

radicals attached to the remainder of the molecule by an oxygen, e.g., ethoxy, methoxy, or n- 
propoxy. The term "alkylthio" refers to alkyl radicals attached to the remainder of the 
molecule by a sulfur. 

The term of "acyl" refers to a radical derived from an organic acid by the 
10 removal of the hydroxyl group. Examples include acetyl, propionyl, oleoyl, myristoyl. 

The present invention is also useful for synthesizing a variety of compounds 
that comprise selectin-binding carbohydrate moieties. These selectin-binding moieties have 
the general formula: 

R'Gaip 1 ,m(Fuca 1 ,n)GlcNR°(R 2 ) — 
1 5 in which R° is (C,-C 8 alkyl)carbonyl, (C,-C 8 alkoxy )carbonyl, or (C 2 -C 9 

alkenyloxy)carbonyl, R 1 is an oligosaccharide or a group having the formula 



20 




C0 2 H 



R 3 and R 4 may be the same or different and may be H, C,-C 8 alkyl, hydroxy- 
25 (C,-C 8 alkyl), aryl-(C,-C 8 alkyl), or (C r C 8 alkoxy)-(C r C 8 alkyl), substituted or unsubstituted. 
R 2 may be H, C r C 8 alkyl, hydroxy-(C,-C 8 alkyl), aryl-(C r C 8 - alkyl), (C r C 8 alkyl)-aryl, 
alkylthio, ctl,2Man, ctl,6GalNAc, pl,3Galpl,4Glc, al,2Man-R 8 , <xl,6GalNAc-R 8 , and 
pl,3Gal-R 8 . R 8 may be H, C r C 8 alkyl, C r C 8 alkoxy, hydroxy-(C r C 8 alkyl), aryl-(C r C 8 
alkyl), (C r C 8 alkyl)-aryl, or alkylthio. In the formula, m and n are integers and may be either 
30 3 or 4; p may be zero or 1 . 

The substituted groups mentioned above may be substituted by hydroxy, 
hydroxy(C,-C 4 alkyl), polyhydroxy(C,-C 4 alkyl), alkanoamido, or hy<Jroxyalknoamido 
substituents. Preferred substituents include hydroxy, polyhydroxy(C 3 alkyl), acetamido and 
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hydroxyacetamido. A substituted radical may have more than one substitution, which may 

be the same or different. 

For embodiments in which R 1 is an oligosaccharide, the oligosaccharide is 

preferably a trisaccharide. Preferred trisaccharides include NeuAca2,3Galpl,4GlcNAcpl,3 

5 or NeuGcot2,3Galpl,4GlcNAcpl,3. 

For embodiments in which R 1 is the group having the formula: 
R 3 



10 




R 3 and R 4 preferably form a single radical having the formula: 

— R 5 — or — (R 6 ) q — O — (R 7 ) r — 
in which R 5 is C 3 -C 7 divalent alkyl, substituted or unsubstituted, R 6 and R 7 are the same or 

15 different and are C,-C 6 divalent alkyl, substituted or unsubstituted. In the formula, q and r 
are integers which may be the same or different and are either zero or 1 . The sum of <q and r 
is always at least 1 . 

A more preferred structure for a single radical formed by R 3 and R 4 is one 
having the formula: 

20 — (R 6 )— O— 

in which R 6 is C 3 -C 4 divalent alkyl, substituted or unsubstituted. For instance, R 6 may have 
the formula — CH 2 — CH 2 — CH 2 — CH 2 — , preferably substituted. The radical can be 
substituted with hydroxy, polyhydroxy(C 3 alkyl), and substituted or unsubstituted 
alkanoamido groups, such as acetamido or hydroxyacetamido. The substituted structure will 

25 typically form a monosaccharide, preferably a sialic acid such as NeuAc or NeuGc linked 
a2,3 to the Gal residue. 

In the general formula, above, both m and n are integers and can be either 3 or 
4. Thus, in one set of structures Gal is linked pi,4 and Fuc is linked a 1,3 to GlcNAc. This 
formula includes the SLe* tetrasaccharide. SLe x has the formula 

30 NeuAca2,3Gaipi,4(Fucal,3)GlcNAcpi — . This structure is selectively recognized by 
LECCAM-bearing cells. SLe x compounds that can be purified using the methods of the 
invention include NeuAca2,3Galpl,4(Fucal,3)GlcNAcpi-Gal-OEt, 
NeuAca2,3Galpl,4(Fucal,3)GlcNAcpi,4Gaipi-OEt, and others that are described in 
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international application WO 91/1 9502. Other compounds that one can purify using the 
methods include those described in US Patent No. 5,604,207 having the formula 







o 


R 



1 0 wherein Z is hydrogen, C r C 6 acyl or 





?Lr 5 


r4 orS 





15 

Y is selected from the group consisting of C(O), S0 2 , HNC(O), OC{0) and 

SC(O); 

R 1 is selected from the group consisting of an aryl, a substituted aryl and a 
phenyl C,-C 3 alkylene group, wherein said aryl substitutent is selected from the group 
20 consisting of a halo, trifuloromethyl, nitro, C r C 18 alkyl, C r C 18 alkoxy, amino, mono-C r C I8 
alkylamino, di-C r C 18 alkylamino, benzylamino, C r C 18 alkylbenzylamino, C,-C l8 thioaklyl 
and C r C l8 alkyl carboxamido groups, or 

R'Y is allyloxycarbonyl or chloroacetyl; 

R 2 is selected from the group consisting of monosaccharide (including 
25 pi ,3Gal-OR, where R=H, alkyl, aryl or acyl), disaccharide, hydrogen, C,-C 18 straight chain, 
branched chain or cyclic hydrocarbyl, C r C 6 alkyl, 3-(3,4,5-trimethoxyphenyl)propyl, C r C 5 
alkylene co-carboxylate, co-trisubstituted silyl C 2 -C 4 alkylene wherein said <o-trisubstituted 
silyl is a silyl group having three substituents independently selected from the group 
consisting of C r C 4 alkyl, phenyl, 
30 or OR 2 together form a C,-C 18 straight chain, branched chain or cyclic 

hydrocarbyl carbamate; 

R 3 is hydrogen or C,-C 6 acyl; 
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R 4 is hydrogen, C,-C 6 alkyl or benzyl; 

R 5 is selected from the group consisting of hydrogen, benzyl, methoxy benzyl, 
dimethoxybenzyl and C,-C 6 acyl; 

R 7 is methyl or hydroxymethyl; and 
5 X is selected from the group consisting of C,-C 6 acyloxy, C 2 -C 6 

hydroxylacyloxy, hydroxy, halo and azido. 

A related set of structures included in the general formula are those in which 
Gal is linked pi,3 and Fuc is linked al,4. For instance, the tetrasaccharide, 
NeuAca2,3Gaipi,3(Fucal,4)GlcNAcpi — , termed here SLe a , is recognized by selectin 
10 receptors. See, Berg et al, J. Biol Chem., 266:14869-14872 (1991). In particular, Berg et 
aL showed that cells transformed with E-selectin cDNA selectively bound neoglycoproteins 
comprising SLe a . 

The GDP-fucose produced by the methods of the invention can be used to 
modify oligosaccharides present on glycoconjugates, including glycoproteins and 

15 glycolipids. Proteins that can be modified by the methods of the invention include, for 

example, hormones such as insulin, growth hormones (including human growth hormone 
and bovine growth hormone), tissue-type plasminogen activator (t-PA), renin, clotting 
factors such as factor VIII and factor IX, bombesin, thrombin, hemopoietic growth factor, 
serum albumin, receptors for hormones or growth factors, interleukins, colony stimulating 

20 factors, T-cell receptors, MHC polypeptides, viral antigens, glycosyltransferases, and the 

like. Polypeptides of interest for recombinant expression and subsequent modification using 
the methods of the invention also include a 1 -antitrypsin, erythropoietin, granulocyte- 
macrophage colony stimulating factor, anti-thrombin III, interleukin 6, interferon p, protein 
C, fibrinogen, among many others. This list of polypeptides is exemplary, not exclusive. 

25 The methods are also useful for modifying the glycosylation patterns of chimeric proteins, 
including, but not limited to, chimeric proteins that include a moiety derived from an 
immunoglobulin, such as IgG. 

The following examples are offered to illustrate, but not to limit the present 

30 invention. 
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Example 1 

Cloning and Expression of Nucleic Acids Encoding GDP-Fucose Synthesizing Enzymes 

This Example describes the cloning and expression of nucleic acids that 
encode enzymes useful in the enzymatic conversion of GDP-mannose to GDP-fucose, and 
the use of these enzymes to produce GDP-fucose. 

Methods 

A. Amplification of GDP-mannose 4,6-dehydratase Open Reading Frame 

The GDP-mannose dehydratase (GMD) open reading frame was amplified by 
PCR using primers based on the known sequence for GMD (Stevenson et ah (1996) 7. 
Bacteriol 178: 4885-4893; Tonetti et al (1996) J. Biol Chem. 271: 27274-27279). Genomic 
DNA from E. coli K12 was purified using a kit from Qiagen, Inc. following the 
manufacturer's directions. To amplify the GMD open reading frame, the following primers 
were utilized: 5' primer: 5 ' -CGCTCT AG ATAC ATGTC AAAAGTCGCT-3 ' (SEQ ID NO: 
2); 3' primer: 5 ' -GCGAAGCTTTTATG ACTCC AGCGCG AT-3 * (SEQ ID NO: 3). The 
open reading frame for GDP-mannose dehydratase corresponds to nucleotides 8633-9754 of 
the wca gene cluster of E. coli K12 (Stevenson et al, supra,) (8659-9780 in GenBank 
Accession No. U38473). Each primer was designed with a 5' Xbal site and a 3' HindlU site. 
100 pmole of each primer pair was combined with genomic DNA from E. coli K12. Thirty 
cycles (95°C for 45 sec, 65°C for 1 min,73°C for 3 min) were run using Pfii polymerase. The 
products were gel purified, digested with ^V&tfl and HindlU and subcloned into these sites of 
the bacterial expression plasmid pTGK (PCT/US97/20528; Int'l. Publ. No. WO 98201 1 1). 
Sequencing of the plasmid revealed authentic GDP-mannose dehydratase with an ATG start 
site positioned appropriately for expression in pTGK. This resulted in the expression 
plasmid pTGK:GDP-mannose dehydratase (pTGK:GMD). 

B. Expression of GDP-mannose dehydratase E. coli JM101 

After transformation of pTGK.GMD into the E. coli strain JM101 using 
standard molecular biology protocols, isolated colonies were grown in M9 defined media 
containing galactose to induce the expression of the open reading frame. After an overnight 
incubation at 37°C with shaking, cells were pelleted and resuspended in 50mM Tris pH 8.0 
on ice (5 times the pellet volume). The cells were lysed at 4°C with a French press. The 
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cellular debris and unlysed cells were pelleted in a centrifuge (3000 rpm; Beckman floortop 
centrifuge) for 10 minutes. The resulting supernatant was stored on ice until analysis, up to 2 
hours, by SDS-PAGE and activity assay. 

G Assay of GDP-mannose Dehydratase 

To assay GDP-mannose dehydratase activity, a reverse phase HPLC assay 
was developed. An Aquasil CI 8 reverse phase column was set up in line on a Beckman 126 
solvent monitor. The column was eluted isocratically with 0.5M KH 2 P0 4 over twenty 
minutes with a flow rate of 1 ml/min. A photodiode array detector was employed to monitor 
the elution scanning the 210-300 nm wavelength range. GDP-mannose dehydratase activity 
was determined by reacting GDP-mannose dehydratase extract with 480 nmole of GDP- 
mannose in 50 mM Tris pH 7.5, 0.15 mM NADP* (NADP + is absolutely required for GDP- 
mannose dehydratase activity) in a 100 \x\ reaction volume. Twenty \i\ of the assay mix was 
injected over the CI 8 column. 

By comparing the peak area of the no enzyme control, corresponding to 200 
nmole, the decrease in the peak area of GDP-mannose was used to directly measure the 
activity of GDP-mannose dehydratase by the following formulae: 

1 . (No enzyme peak area) - (enzyme peak area) = (Peak area of GDP- 
mannose consumed) 

2. (Peak area of GDP-mannose consumed)/(No enzyme peak area) = % of 
200 nmole GDP- mannose consumed. 

To control for any carryover loss upon HPLC injection, 10 nmole of CTP was 
added to the sample to be analyzed. 

Z). Amplification of YEF B, human Fx and wca H Open Reading Frames 
Nucleic acids that include the open reading frames of YEF B, wcaH, and 
human Fx were amplified using primers that were synthesized based on the reported 
nucleotide sequences (Stevenson et ah (1996) supra. ; Tonetti et al. (1996) supra.). Genomic 
DNA from E. coli K12 was purified using a kit from Qiagen, Inc. following the 
manufacturer's directions. To amplify these open reading frames the following primers were 
utilized: 
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YEFB 

5' primer: 5 '-CGTCCTAGAGCGATGAGTAAACAACGAGTT-3 ' (SEQ ID NO: 4); 
3' primer: 5 '-GCG AAGCTTTTACCCCCG AAAGCGGTC-3 ' (SEQ ID NO: 5); 

wca H 

5' primer: 5'-GCTCTAGAGTAATGATGTTTTTACGTCAGG-3' (SEQ ID NO: 6); 
3' primer 5 '-CCC AAGCTTTCATAATCCGGGTACTCCGGT-3 ' (SEQ ID NO: 7); and 

human Fx 

5' primer: 5'-GCTCTAGAGACATGGGTGAACCCCAGGGAT-3' (SEQ ID NO: 8); 
3' primer: 5 ' - ACG AAGCTTC ACTTCCGGGCCTGCTCGTAGTTG-3 ' (SEQ ID NO: 9). 

YEF B and wcaH correspond to nucleotides 9757-10722 (9783-10748 in 
GenBank Accession No. U38473) and 10722-1 1204 (10748-1 1230 in GenBank Accession 
No. U38473), respectively, of the wca gene cluster of E. coli K12 (Stevenson et ah, supra.). 

Each primer was designed with a 5' Xbal site and a 3' Hindlll site. One 
hundred pmole of each primer pair was combined with genomic DNA from E. coli K12 or, 
in the case of human Fx, cDNA from human placenta. Thirty cycles (95 °C for 45 sec, 65°C 
for 1 min,73°C for 3 min) were run with Pfu polymerase. The products were gel purified, 
digested with Xbal and Hindlll and subcloned into these sites of the bacterial expression 
plasmid pTGK (PCT/US97/20528; Int'l. Publ. No. WO 98201 1 1). This resulted in the 
following expression plasmids: pTGK: YEF B, pTGK:human Fx (pTGK:Fx) and 
pTGK.wcaH. 

E. Expression of YEFB (xvcaG), Human Fx and wcatf in E. coli JM101 

After transformation of pTGK:YEF B, pTGK.Fx and pTGK:wcaH into the E. 
coli strain JM101 using standard molecular biology protocols, isolated colonies were grown 
in M9 defined media containing galactose to induce the expression of the open reading 
frames. After an overnight incubation at 37°C with shaking, cells were pelleted and 
resuspended in 50mM Tris pH 8.0 on ice (5 times the pellet volume). The cells were lysed at 
4°C with a French press. The cellular debris and unlysed cells were pelleted in a centrifuge 
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(3000 rpm Beckman floortop) for 10 minutes. The resulting supernatant was stored on ice 
until analysis, up to 2 hours, by SDS-PAGE and activity assay. 

F. Activity assay for YEF B and human Fx 

To assay YEF B activity, a reverse phase HPLC assay was developed. An 
5 Aquesil CI 8 reverse phase column was set up in-line on a Beckman 126 solvent monitor. 
The column was eluted isocratically with 0.5M KH 2 P0 4 over twenty minutes with a flow 
rate of 1 ml/min. A photodiode array detector was employed to monitor the elution scanning 
the 210-300 nm wavelength range. 

A coupled assay was employed to assay YEF B activity. This was required 

10 since its substrate, GDP-4-keto-6-deoxymannose, is unstable and not commercially 

available. Twenty mUnit of GDP-mannose dehydratase was allowed to react with 480 nMole 
of GDP-mannose for 20 minutes under standard assay conditions as previously. At this time 
greater than 90% of the GDP-mannose has been converted to GDP-4-keto-6-deoxymannose, 
the substrate for YEF B. The following additions were then made: 500 nmole glucose, 

1 5 MgCl 2 to 20 mM, NADPH to 0. 1 5 mM, 1 0 mUnit glucose dehydrogenase, 1 00 nmole CTP 
(to normalize for losses that may occur when sample is injected onto HPLC), YEF B and 
water to 50 \x\. The reaction was allowed to proceed for ten minutes at which point the 
reaction was stopped by freezing on dry ice. After thawing 10 aliquots were analyzed by 
reverse phase HPLC analysis. The amount of GDP fucose formed was calculated by 

20 analyzing a separate assay mix in the absence of GDP-mannose dehydratase to yield a 100% 
value for the absorption peak of GDP-mannose. This peak area was then divided by the peak 
area of GDP-fucose formed to yield percentage of GDP-fucose formed. (% GDP- 
fucose)(nMole GDP-mannose in reaction) = nMole GDP-fucose formed. 

G. Characterization of the sugar nucleotide formed by GDP-mannose 
25 dehydratase and YEFB 

To confirm that GDP-mannose dehydratase and YEF B convert GDP- 
mannose into GDP-fucose, GDP-mannose dehydratase and YEF B extracts were reacted 
with 5 mg of GDP-mannose and 6 mg of NADPH in 50 mM Tris pH 8.0. The reaction 
volume was 1 ml. After 3 hours, 95% of the GDP-mannose was converted into an adsorption 
30 peak that migrated with GDP-fucose. The reaction was quenched with 2 ml methanol, the 
precipitate was pelleted and the supernatant was dried under vacuum. The resulting sugar 
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nucleotide was then reacted with sialyl N-acetyllactosamine (SLN; a precursor to sialyl 
Lewis"), fucosyltransferase V (20 mUnit), 1 mM MnCl 2 in 50 mM Tris pH 7.5 for 24 hours 
at 37°C. The reaction volume was 200 The reaction was approximately 80-85% complete 
as judged by TLC. The oligosaccharide product was resolved from nucleotides and protein 
by purification over a 1 5 ml P2 column eluted with 20% ethanol and monitoring the elution 
with a photodiode array detector. The oligosaccharide product was then dried under vacuum, 
protons exchanged with D 2 0 and subjected to 500 Mhz proton NMR. This analysis revealed 
that authentic sialyl Lewis x -containing compound was formed, demonstrating that GDP- 
mannose dehydratase and YEF B act in concert to form GDP-fucose. 

H. Synthesis of GDP-fucose at 0.2 kG scale 

800 grams of a bacterial cell pellet expressing GDP-mannose dehydratase 
(GMD) was resuspended in 4 liters of ice cold 50 mM Tris pH 7.5. The cells were disrupted 
by one passage through a microfluidizer, debris was pelleted by centrifugation at 4,000 rpm 
for 15 minutes and the supernatant was separated from the pellet by decanting. 

200 grams of GDP-mannose, 87% pure by HPLC, was obtained from 
Boehringer Mannheim and dissolved in 1 liter of 50 mM Tris pH 7.5. To this solution, 4 
liters of the GMD extract (2000 Unit) was added along with NADP + to a final concentration 
of 0.15 mM. The final concentration of GDP-mannose was 66 mM. Sodium azide was added 
to 0.05% to inhibit bacterial growth. The reaction was incubated at 32°C with mild shaking 
and the reaction followed by HPLC. After 5 hours, 98% of the GDP-mannose had been 
converted into its intermediate, GDP-4-keto-6-deoxymannose, at which time a YEF B 
supernatant from a 300 gram bacterial cell pellet expressing YEF B derived identically to 
GMD supernatant (1 500 Unit), was added to the reaction mixture along with 20mg NADPH, 
0.66 mole of glucose (2 molar equivalents to GDP-mannose), and 1 800 Unit glucose 
dehydrogenase. Incubation at 32°C was continued overnight with mild continuous shaking. 
The following morning, 100% of the intermediate, GDP-4-keto-6-deoxymannose, had been 
converted to GDP-fucose as determined by HPLC. This material was passed through a 10 kD 
MWC tangential flow membrane to remove high molecular weight material including 
protein and bacterial polysaccharides. The effluent containing GDP-fucose was then 
precipitated by addition of ethanol to 80%. After storage at 4°C, the supernatant was 
decanted and the precipitated GDP-fucose was pelleted in a centrifuge at 4500 rpm for 20 
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minutes. The supernatant was removed and the pellets resuspended in water and lyophilized. 
From 200 grams of GDP-mannose, 140 grams of GDP-fucose was obtained for a molar yield 
of 72%. The purity of the GDP-fucose was similar to that of GDP-mannose, approximately 
87%. 

5 /. Synthesis of Sialyl Lewis X Antigen at the 100 gram scale Using Enzymatic 

Fucosylation 

Approximately 128 grams or 0.15 mole of SLN (the tetrasaccharide precursor 
of sialyl Lewis X) was added to 1 10 grams or 0.18 mole of GDP-fucose and resuspended in 
2.5 liters of water. 5m Tris pH 7.5 was added to a final concentration of 50 mM and MgCl 2 

1 0 was added to a final concentration of 20 mM. 440 ml of fucosyltransferase V (FT V) at 1 .24 
Unit/ml purified from A. niger broth was added along with 600 units of alkaline phosphatase 
from Boehringer Mannheim. Sodium azide was added to 0.04%. The final volume was 3 
liters. The reaction was monitored by TLC, HPLC to determine the level of GDP-fucose, 
GDP and MgCl 2 . The pH of the reaction was monitored by pH meter. The solvent system 

1 5 used for TLC analysis was isopropanol : water 4:1. After the reaction was judged to be 
complete by TLC, sialyl Lewis X was purified using ultrafiltration. 

J. Synthesis of Sialyl Lewis X Antigen at 1 mg Scale Using an Enzymatic Half 
Cycle 

The half cycle was run using the following conditions (all concentrations are 
20 final): SLN 5mM; GDP-mannose, lOmM; Glucose, 18 mM; MgCl 2 , 20 mM; Tris pH 7.5, 50 
mM; NADP + , 0.12 mM; GDP-mannose dehydratase, 1.16 Unit/ml; YEF B 740mUnit/ml; 
glucose dehydrogenase, 330mUnit/ml; FT V, 46mUnit/ml; pyruvate kinase, 16.5Unit/ml; 
sodium azide, 0.04%. The final volume was 220^1. The reaction was monitored by TLC and 
HPLC. 

25 Results 

A. Characterization of GDP-mannose Dehydratase 
1. Expression of GDP-mannose Dehydratase 

The expression of GDP-mannose dehydratase was analyzed by SDS-PAGE. 
The results are shown in Figure 2, lane 4. A large band is present at 42 kD, the predicted 
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molecular weight; this band is not present in the vector alone control (lane 1) or the cells 
expressing YEF B (lanes 2 and 3). 

2. Characterization of GDP-mannose dehydratase 

To characterize the enzymatic activity of GDP-mannose dehydratase, a linear 
HPLC assay was developed. The assay, which is based on hydrophobic differences between 
GDP-mannose, GDP-4-keto-6-deoxymannose and GDP-fucose, is capable of resolving 
GDP-mannose, GDP-4-keto-6-deoxy-mannose, GDP-4 keto-6-deoxy-galactose and GDP- 
fucose. Relevant chromatograms are displayed in Figure 3. The top chromatogram displays 
GDP-mannose after one minute in the presence of GDP-mannose dehydratase, and the 
bottom chromatogram displays the same reaction after 55 minutes. As indicated by the 
chromatograms, greater than 98% of the GDP-mannose has been converted to GDP-4-keto- 
6-deoxymannose. Proof that this product is GDP-4-keto-6-deoxymannose is demonstrated by 
addition of NADPH and YEF B or Fx protein converting the product to GDP fucose. The 
salient features of this assay are: 1) concomitant with a decrease in GDP-mannose there is an 
increase in the intermediates formed by GDP-mannose 4,6 dehydratase that are resolved by 
the column, serving as an internal control to demonstrate that the decrease in GDP-mannose 
is not due to a hydrolytic activity, and 2) quantitative reproducibility. 

3. Linearity of the GDP-mannose dehydratase assay with time 

To demonstrate the linearity of the GDP-mannose dehydratase assay with 
time, the standard assay was performed in a single eppendorf tube and 10 |il aliquots were 
removed at times ranging from 0-30 minutes and immediately frozen on dry ice prior to 
reverse phase HPLC analysis. Samples were thawed and immediately injected over an 
aquasil reverse phase HPLC column. As displayed in Figure 4, the assay is linear up to 30 
minutes. Surprisingly, the assay remained linear even when up to 90% of the substrate was 
consumed. This indicates that the GDP-mannose concentration remains above the Km of 
GDP-mannose dehydratase and that GDP-4-keto-6 deoxymannose does not inhibit the 
enzyme. 

4. pH Optimum 

To determine the pH optimum of the enzyme, pH was varied from pH 6.5-8.0 
using 50 mM MES or Tris as buffers and the standard assay performed. Table 4 displays the 
pH profile of GDP-mannose dehydratase and illustrates a pH optimum of 7.0. Below pH 7.0, 
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the activity is about 70% that of pH 7.0, while pH's above 7.0 yields only a moderate 
decrease in activity. 



Table 4: pH Profile of GMD 



l P H 


Activity (Unit/ml) 


6.5 


3.1 


7.0 


4.7 


7.5 


4.5 


8.0 


4.0 



5 Since GDP-mannose dehydratase will be used in conjunction with the 



epimerase/reductase and fucosyltransferase which both have pH optima of 7.5, pH 7.5 was 
chosen as the working pH for the standard assay. 

5. Dependence on NADP 

When assayed in the absence of NADP + , GDP-mannose dehydratase is 
1 0 completely inactive. Additionally, NAD + will not substitute for NADP\ Moreover, if 

alkaline phosphatase is included in the assay, GDP-mannose dehydratase activity is severely 
attenuated. This data is displayed in Table 5 below. 



Table 5: NADP" Dependence of GMD 



JNADPl 


% GDP-mannose dehydratase activity 


0.15 mM 


100 


1 mM 


100 


None 


not detected 


10 mUnit Alkaline Phosphatase 


not detected 


1 mM NAD 


not detected 



15 

6. GDP-mannose dehydratase is not dependent on divalent cations 
When GDP-mannose dehydratase was assayed under standard assay 
conditions in the presence or absence of MgCl 2 or MnCl 2 at concentrations ranging from 1- 
20 mM, essentially no effect was observed on activity. These results are illustrated in Table 
20 6. 
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Table 6: Dependence of GMD on Divalent Metal Cations 



[Metal Ion] % GDP-mannose Dehydratase Activity 

mUnit/ml 

1 mM MnCl 2 580 
10mMMnCl 2 570 
20 mM MnCl 2 620 
1 mM MgCl 2 560 
10mMMgCl 2 610 
20 mM MgCI 2 590 



7. Inhibition of GDP-mannose Dehydratase by GDP-fucose 
Since GDP-mannose dehydratase represents the first committed step in GDP- 
5 fucose biosynthesis it is possible that GDP-fucose is a feedback inhibitor of the enzyme. To 
determine if this is true, the standard assay was performed in the presence of increasing 
amounts of GDP-fucose. Figure 5 displays the inhibition curve. It is clear that GDP-fucose is 
a potent inhibitor of this enzyme with an IC 50 of approximately 40)iM. This is very important 
from the standpoint of synthesizing fucosylated oligosaccharides since the synthetic process 
10 will require that 1) GDP-fucose is removed from the reaction mixture as would occur with 
the addition of fucose to an oligosaccharide chain via fucosyltransferase; or 2) that all GDP- 
mannose is converted to its intermediate, GDP-4-keto-6-deoxymannose prior to conversion 
- into GDP-fucose. 

B. Characterization of YEFB, wca/f, and human Fx Expressed in E. coli 

15 1. YEF B Expression 

To test for expression, bacteria harboring these plasmids were grown, lysed 
and analyzed by SDS-PAGE (Figure 2). These lysates were compared with bacterial lysates 
harboring the pTGK vector alone (Lane 1). Lysates harboring two separate YEF B clones 
(Lanes 2 and 3) display highly expressed proteins at 36.5 Kd that are absent in lysates from 

20 bacteria harboring the vector alone (Lane 1 ). The predicted molecular weight of YEF B is 
36.5 Kd (Stevenson et al. (1996) J. Bacterid 178: 4885-4893; Tonetti et aL, supra.). 

2. wcaH and Human Fx Expression 

The SDS-PAGE shown in Figure 6 displays the expression of wcaU and 
human Fx in E. coli from the expression vectors pTGKrwcaH and pTGK:human Fx 
25 (pTGK:Fx), respectively. A large band was observed for wcaU. Somewhat unexpectedly, the 
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human Fx protein was also expressed at significant levels in bacteria using this expression 
system. 

3. Assay of YEF B and human Fx Activity 

YEF B and human Fx were assayed using an HPLC method that resolves 
5 GDP-mannose from GDP-4-keto-6-deoxymannose, GDP-4-keto-6-deoxyglucose and GDP- 
fiicose based on hydrophobicity as described in methods. Figure 7A shows the HPLC 
analysis 55 minutes after addition of GDP-mannose dehydratase. Essentially all the GDP- 
mannose has been consumed and converted into intermediate products. Upon addition of 
YEF B and NADPH to this assay mix, a peak with the same absorption spectrum as GDP- 
10 fiicose and co-migrates with GDP-fucose (Figure 7B). If an irrelevant extract is used in place 
of YEF B extract (extract from JM101 cells harboring the pTGK plasmid only) the GDP- 
fucose absorption peak does not appear. 

This indicates that GDP-mannose dehydratase works in conjunction with 
YEF B to form GDP-fucose. Therefore YEF B, similar to its human homologue, Fx, is both 
15 an epimerase and a reductase. Identical results are obtained if YEF B is purified by Cibacron 
Blue chromatography, providing further evidence that YEF B encodes both epimerase and 
reductase activities in a single enzyme. 

Human Fx protein also displayed significant activity in this assay, although at 
levels approximately 10 fold less than YEF B ? This mirrors the expression level deduced 
20 from SDS-PAGE gels. 

4. pH optimum 

The assay pH was varied from 6.5 to 8.0 using MES and Tris as buffers. The 
standard assay mix contained 20 mUnit GDP-mannose dehydratase, 480 nmole GDP- 
mannose, 500 nmole glucose, 20 mM MgCl 2 , 0.2 mM NADPH, 10 mUnit glucose 
25 dehydrogenase (to recycle NADPH utilized by YEF B), 50 mM buffer at the appropriate pH 
and YEF B at appropriate dilution. The reaction was allowed to proceed for 1 5 minutes at 
37°C and immediately frozen on dry ice to stop the reaction. Ten jal aliquots were removed 
after thawing and analyzed by reverse phase HPLC. As shown in Table 7, YEF B has a pH 
optimum of 7.5. 
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Table 7: pH Optimum of YEF B 



pH 


Activity (Unit/ml) 


6.5 


.83 


7.0 


.91 


7.5 


1.2 


8.0 


.41 



5. Metal Ion Requirement 

To study the metal ion requirement, standard assay conditions were employed 
5 with the addition of varying concentrations of MgCl 2 and MnCl 2 . These two divalent cations 
were chosen since YEF B can be utilized in a fucose cycle and fucosyltransferase V requires 
either MgCl 2 or MnCl 2 for activity. Table 8 displays the results, which show that YEF B is 
severely inhibited in a dose dependent manner by MnCl 2 at concentrations required for 
fucosyltransferase V activity. In contrast MgCl 2 was actually somewhat stimulatory. Thus, 
1 0 the fucose "half cycles" are preferably run in the presence of MgCl 2 but not MnCl 2 . 



Table 8: Analysis of Metal Ion Effect on YEF B Activity 



[Metal Ion] 


Activity % 


none 


100 


1 mM MnCl 2 


60 


lOraMMnQj 


25 


1 mM MgCl 2 


110 


10mMMgCl 2 


110 



6. Linearity over Time 

1 5 To determine the linearity of the coupled YEF B assay over time, standard 

assay conditions were employed. Ten jal aliquots were removed from the same vial over 30 
minutes. A separate tube containing no GDP-mannose dehydratase was used as the 1 00% 
control. Figure 8 displays the results of this assay with respect to time. Although the assay is 
not linear with respect to time, it does provide an excellent approximation of activity. Likely 

20 explanations for the lack of linearity include product inhibition of the epimerization step or 
reduction step, or both. Regardless, utilizing this assay indicates that a twenty liter 
fermentation yields 600-700 unit/liter; sufficient for multi kilogram synthesis of sialyl Lewis 
X antigen or other fucosylated oligosaccharides of interest. 
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7. Production of GDP-fucose from GDP-mannose on a 0,2 kG scale 

To demonstrate that GDP-mannose dehydratase and YEF B can be used to 
produce GDP-fucose at scale, 200 grams of GDP-mannose was converted into GDP-fucose. 
This was accomplished using 2000 unit of GDP-mannose dehydratase and 2000 unit of YEF 
5 B. To achieve complete conversion of GDP-mannose into its intermediate, GDP-4-keto-6- 
deoxymannose, YEF B and glucose dehydrogenase must be added after all GDP-mannose 
has been converted to GDP-4-keto-6-deoxymanriose. This is because GDP-fucose, the 
product of YEF B, is a 40^iM inhibitor of GDP-mannose dehydrogenase (see Figure 5). 
Since the GDP-mannose concentration in the reaction mixture is approximately 65 mM, once 

1 0 YEF B is added, along with NADPH and glucose dehydrogenase to recycle NADPH, the 

conversion of GDP-mannose into its intermediate by GDP-mannose dehydrogenase is halted. 
Thus the reaction is preferably closely monitored by HPLC to track the amount of GDP- 
mannose consumed and the amount of GDP-4-keto-6-deoxymannose generated. 

Once the GDP-mannose has been completely converted into its intermediate, 

1 5 2000 unit of YEF B, 900 unit glucose dehydrogenase, 1 gram NADPH and 1 .2 molar 

equivalents of glucose (relative to GDP-mannose) was added to the reaction mixture and 
incubated overnight at 35°C. The mixture was processed by passing the mixture through a 
1 OkD M WC tangential flow filtration device to remove the majority of protein and 
polysaccharide. The GDP-fiicose was purified and concentrated by ultrafiltration followed 

20 by precipitation by addition of ethanol to 80%. The precipitation step effectively removed 
free monosaccharides (glucose and gluconate) that were not removed by the ultrafiltration 
step. On a moleimole basis this equates to an approximate 65% yield. HPLC analysis was 
used to confirm that the reaction was complete (Figure 9). 

8. Fucosvlation of Sialyl Lewis X Antigen Precursor at the 1 00 Gram Scale 
25 To demonstrate the utility of bacterial GDP-mannose dehydratase and YEF B 

for the large scale synthesis of sialyl Lewis X antigen and other bioactive carbohydrates, a 
100 gram reaction was conducted in which Neu5Aca2,3Gaipi,4GlcNAcpl,3Gal-OR (SLN), 
the tetrasaccharide precursor of sialyl Lewis X antigen, was fucosylated to form sialyl Lewis 
X antigen. Conversion of SLN into sialyl Lewis X antigen was monitored by HPLC, TLC 
30 and pH by pH electrode. The results of this assay are shown in Figure 10, in which formation 
of sialyl Lewis X is monitored by detecting the increase in free GDP concentration. The pH 
of the reaction mixture is also shown. As GDP was hydrolyzed by alkaline phosphatase 
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liberating inorganic phosphate, the pH of the reaction decreased. The liberated inorganic 
phosphate also formed a precipitate with magnesium ions, decreasing the concentration of 
free magnesium, as shown in Figure 11. 

The decrease in pH and magnesium ions dramatically slows the reaction for 

5 " two reasons: 1) as the pH decreases, the activity of alkaline phosphatase is severely inhibited 
causing GDP to buildup to levels that inhibit the fucosyltransferase; and 2) 
fucosyltransferase V requires magnesium at a concentration of 20 mM for optimal activity. 
Thus as the magnesium concentration decreases, the fucosyltransferase activity also 
decreases. Therefore, the reaction pH and magnesium concentrations are preferably 

1 0 monitored and the reaction supplemented with MgCl 2 and the pH maintained at 8.0 by 

addition of NaOH. Since calf intestinal phosphatase has a pH optimum of 9.5 and has only 
10% of its activity at pH 8.0, a neutral phosphatase (one with a neutral pH activity) is 
optimal for this reaction. 

It is clear from Figure 12 that the reaction proceeds very rapidly initially, and 

1 5 then slows dramatically. This was due to the buildup of GDP and decrease in magnesium as 
discussed above. Once the pH was brought up to 8.0 and maintained at this level, and the 
reaction was supplemented with alkaline phosphatase and magnesium (which was done at 50 
hours in this reaction) the reaction proceeds to greater than 95% completion with 15 grams 
of GDP-fucose added to supplement the reaction. 

20 9. NMR Analysis of Final Sialyl Lewis X Reaction Product at 100 Gram 

Scale 

The final reaction product was purified by ultrafiltration and subjected to 
NMR analysis to confirm the addition of fucbse to Neu5Aca2,3Gaipi,4GlcNAc{51,3Gal- 
OR. Figure 13 shows the 500 Mhz proton NMR spectra and the pertinent chemical shifts for 
25 fucose. The spectrum is identical to authentic sialyl Lewis X antigen and confirms that the 
concerted action of GDP-mannose dehydratase and YEF B forms GDP-fucose. This donor 
substrate can then be utilized by fiicosyltransferases to synthesize fucosylated 
oligosaccharides; in this example sialyl Lewis X antigen. 

10. Conversion of SLN to Sialyl Lewis X Antigen Using an Enzymatic Half 
30 Cycle 

Previous experiments demonstrated that alkaline phosphatase inactivated 
GDP-mannose dehydratase by cleavage of its cofactor (NADP) phosphate group (See Table 
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5). Thus alkaline phosphatase could not be utilized to hydrolyze GDP to its free nucleotide. 
Removal of GDP is required for this reaction to proceed to completion since GDP is a potent 
inhibitor of fucosyltransferases (Shinoda et al (1997) J. Biol Chem. 272: 31992-31997). To 
accomplish the removal of GDP, conversion of GDP to GTP was tried. GTP is a 1 Ox less 
5 potent inhibitor of fucosyltransferase V than GDP. To convert GDP to GTP, PEP and 
pyruvate kinase were employed in the reaction mixture. The schematic for this GDP- 
mannose half-cycle fucosyltransferase reaction is shown in Figure 14. As shown by the TLC 
in Figure 15, the reaction consisting of 5 mM SLN and 10 mM GDP-mannose proceeds to 
approximately 95% completion after 2 days. A 10 fold molar excess of PEP to GDP- 

1 0 mannose was required for completion of this reaction. 

Two important results emerge from this experiment: 1) It is possible for the 
GDP-mannose half cycle to proceed to near completion if GDP is converted to GTP; and 2) 
strong evidence is provided that a complete cycle starting from mannose with the addition of 
hexokinase, phosphomannomutase and GDP-mannose pyrophosphorylase, would result in an 

1 5 efficient reaction. See Figure 15 for an illustration of the full fucosyltransferase cycle, in 
which GDP-fucose is synthesized from mannose. 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
20 suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference for all purposes. 
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WHAT IS CLAIMED IS: 



1 1 . An expression vector that comprises a promoter operably linked to a 

2 nucleic acid that encodes a prokaryotic enzyme that has both an epimerase and a reductase 

3 activity that catalyze the conversion of GDP-4-keto-6-deoxymannose to GDP-fucose, 

4 wherein the expression vector lacks an E. coli weal coding region. 

1 2. The expression vector of claim 1 , wherein the nucleic acid encodes a 

2 polypeptide that is substantially identical to an E. coli YEF B polypeptide. 

1 3. The expression vector of claim 2 ? wherein the nucleic acid encodes a 

2 polypeptide that comprises an amino acid sequence of SEQ ID NO: 1 . 

1 4. The expression vector of claim 3, wherein the polypeptide is an E. coli 

2 YEF B polypeptide. 

1 5. The expression vector of claim 1 , wherein the nucleic acid is 

2 substantially identical to a YEF B-encoding nucleic acid. 

1 6. The expression vector of claim 1 , wherein the expression vector lacks an 

2 E. coli wcaU coding region. 

1 7. The expression vector of claim 1 9 wherein the promoter is a 

2 heterologous promoter. 

1 8. The expression vector of claim 7, wherein the promoter is an inducible 

2 promoter. 

1 9. The expression vector of claim 7, wherein the promoter is a dual tac-gal 

2 promoter. 



1 



1 0. A cell which comprises the expression vector of claim 1 . 
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] 11. The cell of claim 1 0 5 wherein the cell is a prokaryotic cell. 

1 1 2. The cell of claim 1 0, wherein the cell is permeabilized. 

1 1 3. A reaction mixture for synthesizing GDP-fucose, the reaction mixture 

2 comprising GDP-4-keto-6-deoxymannose, NADPH, and a prokaryotic enzyme that has both 

3 an epimerase and a reductase activity, wherein the prokaryotic enzyme can catalyze the 

4 conversion of GDP-4-keto-6-deoxymannose to GDP-fucose. 

1 14. The reaction mixture of claim 13, wherein the prokaryotic enzyme is 

2 substantially identical to E. coli YEF B. 

1 15. The reaction mixture of claim 13, wherein the GDP-4-keto-6- 

2 deoxymannose is formed by: 

3 providing a reaction mixture that comprises GDP-mannose, GDP- 

4 mannose-4,6-dehydratase, and NADP + ; and 

5 incubating the reaction mixture for a sufficient time to convert at least 

6 about 90% of the GDP-mannose to GDP-4-keto-6-deoxymannose. 

1 1 6. The reaction mixture of claim 13, wherein the prokaryotic enzyme is 

2 provided as a lysate of cells that comprise an expression vector that comprises a promoter 

3 operably linked to a nucleic acid that encodes the prokaryotic enzyme. 

1 17. A method for enzymatic conversion of GDP-mannose to GDP-fucose, 

2 the method comprising: 

3 a) providing a reaction mixture that comprises GDP-mannose, GDP- 

4 mannose 4,6-dehydratase, and NADP + ; 

5 b) incubating the reaction mixture for a sufficient time to convert at 

6 least about 90% of the GDP-mannose to GDP-4-keto-6-deoxymannose; 

7 c) adding to the reaction mixture one or more polypeptides having 

8 GDP-4-keto-6-deoxymannose 3,5-epimerase and GDP-4-keto-6-galactose reductase 

9 activities; and 
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I o d) incubating the reaction mixture for a sufficient time to convert the 

I I GDP-4-keto-6-deoxymannose to GDP-fucose. 

1 18. The method of claim 1 7, which method further comprises recycling 

2 NADP* or NAD + produced by the reductase activity to NADPH or NADH, respectively, by 

3 including in the reaction mixture of step c) an enzyme that can reduce the NADP + or NAD + , 

4 and a substrate for the enzyme. 

1 1 9. The method of claim 1 8, wherein the enzyme is selected from the group 

2 consisting of alcohol dehydrogenase, glucose dehydrogenase, formate dehydrogenase, 

3 hydrogenase, and glucose-6-phosphate dehydrogenase. 

1 20. The method of claim 19, wherein the enzyme is glucose dehydrogenase 

2 and the substrate is glucose. 



1 21 . The method of claim 1 7, which method further comprises transferring a 

2 fucose from the GDP-fucose to an acceptor saccharide by: 

3 e) adding a fucosyltransferase and the acceptor saccharide to the GDP- 

4 4-keto-6-deoxymannose produced in step b) or to the GDP-fucose produced in step d); and 

5 f) incubating a reaction mixture for a sufficient time to transfer the 

6 fucose from the GDP-fucose to the acceptor saccharide. 

1 22. The method of claim 21 , wherein the steps are each conducted in the 

2 same reaction vessel. 

1 23 . The method of claim 2 1 , wherein the fucosyltransferase and the acceptor 

2 saccharide is added to the product of step d) after substantially all of the GDP-4-keto-6- 

3 deoxymannose is converted to GDP-fucose. 



1 24. The method of claim 23, wherein the GDP-fucose is purified prior to 

2 adding the fucosyltransferase and the acceptor saccharide. 
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1 25. The method of claim 21 , wherein the fiicosyltransferase and the acceptor 

2 saccharide are added to the GDP-4-keto-6-deoxymannose produced in step b) approximately 

3 simultaneously with the performance of step c). 

1 26. The method of claim 21, wherein method further comprises adding a 

2 phosphatase to the reaction mixture of step e), wherein the phosphatase cleaves a phosphate 

3 from GDP but does not cleave a phosphate from NADPH. 

1 27. The method of claim 21 , wherein the method further comprises adding a 

2 kinase and a kinase substrate to the reaction mixture of step e), wherein GDP produced as a 

3 result of the transfer of fucose from the GDP-fucose is converted to GTP. 

1 28. The method of claim 27, wherein the kinase is selected from the group 

2 consisting of pyruvate kinase, polyphosphate kinase, creatine kinase, and acetyl kinase. 

1 29. The method of claim 28, wherein the kinase is pyruvate kinase and the 

2 kinase substrate is phosphenolpyruvate. 

1 30. The method of claim 21 , wherein the acceptor saccharide is selected 

2 from the group consisting of Galp(l-4)Glc-R, Galp(l-3)Glc-R, Gaip(l-4)GlcN(R')P-R and 

3 Gaip(l -3)GlcN(R')P-R, wherein 

4 R is selected from the group consisting of hydrogen, a saccharide, an 

5 oligosaccharide and an aglycon group having at least one carbon atom; and 

6 R 1 is selected from the group consisting of acetyl and allyloxycarbonyl. 

1 31. The method of claim 30, wherein the acceptor saccharide is selected 

2 from the group consisting of NeuAca(2->3)Gaip(l->4)GlcN<R')P(l-»3)Gal^-OR and 

3 NeuAca(2->3)Gaip(l ^3)GlcN(R f )P(l ->3)Galp-OR. 

1 32. The method of claim 3 1 , wherein the acceptor saccharide is selected 

2 from the group consisting of NeuAca(2-»3)Gaip(l ->4)GlcNAcP(l ->3)Gaip-OR and 

3 NeuAca(2-^3)Gaip(l->3)GlcNAcp(l-»»3)Gaip-OR. 
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1 33. The method of claim 30, wherein the fiicosyltransferase is selected from 

2 the group consisting of: an al,3/4 fiicosyltransferase (Fuc -T III), an al,3 fiicosyltransferase, 

3 a bacterial otl ,3 fiicosyltransferase, an <xl,2 fiicosyltransferase. 

1 34. The method of claim 33, wherein the ctl,3 fiicosyltransferase is selected 

2 from the group consisting of Fuc -T IV, Fuc -T V, Fuc -T VI, and Fuc-T VII. 

1 35. The method of claim 33, wherein the al,2 fiicosyltransferase is selected 

2 from the group consisting of Fuc-T I, and Fuc -T II. 

1 36. The method of claim 1 7, wherein the method further comprises an 

2 enzymatic system for generating the GDP-mannose from mannose. 

1 37. The method of claim 36, wherein the enzymatic system for generating 

2 the GDP-mannose from mannose comprises: 

3 hexokinase, which converts mannose to rnannose-6-phosphate; 

4 phosphomannomutase, which converts the mannose-6-phosphate to 

5 mannose- 1 -phosphate; and 

6 GDP-mannose pyrophosphorylase, which converts the mannose- 1- 

7 phosphate to GDP-mannose. 

1 38. The method of claim 1 7, wherein a single polypeptide provides both the 

2 GDP-4-keto-6-deoxymannose 3,5-epimerase activity and the GDP-4-keto-6-galactose 

3 reductase activity. 

1 39. The method of claim 38, wherein the polypeptide is a prokaryotic 

2 enzyme. 

1 40. The method of claim 39, wherein the polypeptide is substantially 

2 identical to an E. coli YEF B polypeptide. 
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1 41 . The method of claim 40, wherein the polypeptide that comprises an 

2 amino acid sequence of SEQ ID NO: 1 . 



1 



42. The method of claim 39, wherein the prokaryotic enzyme is from E. 



2 



coli. 



1 



43. The method of claim 38, wherein the polypeptide is an FX polypeptide 



2 from a eukaryote. 



1 



44. The method of claim 1 7, wherein the method further comprises adding 



2 sufficient divalent metal cation to said reaction medium to restore a portion of said divalent 

3 cation lost during the course of the reaction to thereby achieve or maintain a concentration of 

4 said divalent metal cation in said reaction medium between about 1 mM and about 75 mM, 

5 and wherein the addition of divalent metal cation occurs without interruption of said 

6 enzymatic conversion. 

1 45. A method for the preparation of a fucosylated oligosaccharide, the 

2 method comprising contacting an acceptor saccharide with a fucosylation reaction mixture 

3 that comprises GDP-fucose and a fucosyltransferase which transfers fucose from the GDP- 

4 fucose to provide said fucosylated oligosaccharide, wherein the efficiency of said 

5 fucosylation is enhanced by one or more efficiency-enhancing steps selected from the group 

6 consisting of: 

7 1) forming said GDP-fucose by enzymatic conversion of GDP- 

8 mannose to GDP-fucose by: 

9 a) providing a reaction mixture that comprises GDP-mannose, 

1 0 GDP-mannose 4,6-dehydratase, and NADP + ; 

1 1 b) incubating the reaction mixture for a sufficient time to convert at 

1 2 least about 90% of the GDP-mannose to GDP-4-keto-6-deoxymannose; 

13 c) adding to the product of step b) one or more polypeptides having 

14 GDP-4-keto-6-deoxymannose 3,5-epimerase and GDP-4-keto-6-galactose reductase 

1 5 activities; and 
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16 d) incubating the reaction mixture for a sufficient time to convert 

1 7 the GDP-4-keto-6-deoxymannose to GDP-fucose; 

1 8 2) adding pyruvate kinase and a substrate for the pyruvate kinase to the 

1 9 fucosylation reaction mixture, wherein GDP produced as a result of the transfer of fucose 

20 from the GDP-fucose is converted to GTP; and 

21 3) conducting the fucosylation in a reaction medium that comprises a 

22 soluble divalent metal cation, wherein said medium is supplemented with said soluble 

23 divalent metal cation to maintain the concentration of said divalent metal cation between 

24 about 2 mM and about 75 mM. 

1 46. The method of claim 45, wherein the fucosyltransferase is added to the 

2 reaction mixture after at least about 90% of the GDP-4-keto-6-deoxymannose is converted to 

3 GDP-fiicose. 

1 47. The method of claim 45, wherein the fucosyltransferase and the 

2 polypeptides having GDP-4-keto-6-deoxymannose 3,5-epimerase and GDP-4-keto-6- 

3 galactose reductase are added to the reaction mixture after at least about 90% of the GDP- 

4 mannose is converted to GDP-4-keto-6-deoxymannose. 

1 48. The method of claim 45, wherein each of the reaction steps is conducted 

2 in the same reaction vessel. 

1 49. The method of claim 45, wherein which method further comprises 

2 recycling N ADP + or NAD + produced by the reductase activity to NADPH or NADH, 

3 respectively, by including in the reaction mixture of step c) an enzyme that can reduce the 

4 NADP* or NAD - ", and a substrate for the enzyme. 

1 50. The method of claim 49, wherein the enzyme is selected from the group 

2 consisting of alcohol dehydrogenase, glucose dehydrogenase, formate dehydrogenase, 

3 hydrogenase, and glucose-6-phosphate dehydrogenase. 
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1 51. The method of claim 50, wherein the enzyme is glucose dehydrogenase 

2 and the substrate is glucose. 

1 52. The method of claim 45, wherein the acceptor saccharide is selected 

2 from the group consisting of Gaip(l-4)GlcN(R')P-R and Galp(l-3)GlcN(R')P-R> wherein 

3 R is selected from the group consisting of hydrogen, a saccharide, an 

4 oligosaccharide and an aglycon group having at least one carbon atom; and 

5 R 1 is selected from the group consisting of acetyl and allyloxycarbonyl. 

1 53. The method of claim 52, wherein the acceptor saccharide is selected 

2 from the group consisting of NeuAca(2->3)Gaip(l->4)GlcN(R , )P(l->3)Gal|i-OR and 

3 NeuAca(2->3)Galp(l->3)GlcN(R')P(l~>3)Gaip-OR. 

1 54. The method of claim 53, wherein the acceptor saccharide is formed by 

2 sialylating a compound Gaip(l ->4)GlcN(R') p(l -»3)Gaip-OR or Gaip(l->3)GlcN(R') 

3 P(l— >3)Gaip-OR with a sialyltransferase in the presence of a CMP derivative of a sialic acid 

4 using a <x(2,3)sialyltransferase under conditions wherein sialic acid is transferred to the non- 

5 reducing sugar of the compound. 

1 55. The method of claim 54, wherein the compound Gaip(l ->4)GlcN(R') 

2 P(l->3)Gaip-OR or Galp(l->3)GlcN(R') p(l~>3)Gaip-OR is formed by galactosylating a 

3 compound of the formula GlcN(R')P(l ->3)Gaip-OR or GlcN(R')P(l ->3)Galp-OR, 

4 respectively, with a galactosyltransferase in the presence of a UDP-galactose under 

5 conditions sufficient to form the compound. 
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Figure 4 
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Figure 6 
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Figure 14 
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Figure 1 5 
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Figure 16 
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SEQUENCE LISTING 
SEQ ID NO: 1 Amino acid sequence of E. coli YEF B 

MSKQRVF1AGHRGMVGSAIRRQLEQRGDVELVLRTRDELNLLDSRAVHDFFASERI 

DQVYLAAAKVGGIVANNTYPADFIYQNMMIESNIIHAAHQNDVNKLLFLGSSCIYPK 

LAKQPMAESELLQGTLEPTNEPYAIAKJAGIKLCESYNRQYGRDYRSVMPTNLYGPH 

DNFHPSNSHVIPALLRPJ^HEATAQNAPDVVVWGSGTPMREFLHVDDMAAASIHVM 

ELAHEVWLENTQPMLSHINVGTGVDCTIRDVAQTIAKVVGYKGRVVFDASKPDGTP 

RKLLDVTRLHQLGWYHEISLEAGLASTYQWFLENQDRFRG 
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